Counting number of mutations per read compared to reference
Apologies if question is basic, but I’m very new to NGS analysis!
I have some Miseq DNA sequencing data (in .fastq format) which I can align to sam/bam. What I would like to get from this data is how many mutations (if any) there are for every read, and ideally also the type of mutations present in every read. In other words, some quantitative measure of sequence similarity. What’s the easiest way of getting such data? The objective is to assess random mutagenesis efficiency.
As an extra, I’d like to get same data but after translation of my sequence into protein (to detect nonsense/silent mutations etc.
• 22 views
Read more here: Source link