Help me understand the Nanopore fastqc results

Help me understand the Nanopore fastqc results



I have got my first Nanopore sequencing data and the first step was to see if the data is good. Has anyone has any experience with this kind of data and can tell me how to interpret the results.

The whole report can be downloaded here (not sure how to post it here). Allin all it looks quite good to me, but what I’m not sure about are the two images attached here. These are the per base QC and sequencing content. It seems that the beginning of the reads is not good, but this can be probably trimmed by removing adapters etc. But Is it really that i need to remove the first 1000 positions? This seems a bit extreme.

About the sequencing content, I don’t know where to begin. This looks consistent with the first image, where the quality is not good, and if removing these positions, it should get better. but is it ok for the two pairs TG and AC to be so much apart like that?


I have done both pycoQC as well as the minionQC run (R). They show similar results. It can be downloaded from here
a few of the images are also attached.

Basecalled reads PHRED quality
Output over experiment time
Basecalled reads length vs reads PHRED quality
Channel activity over time

To me it looks as if the run was past its prime after ~50h. There is no real gain of new reads afterwards. As I’m usually work with mRNA-Seq, I’m not sure how to call the PHRED Quality. I can see that the most reads are of Q ~=8 and most of them are short (which is also expected). But all in all can this run be classified as good?


per base QC
per base Seq content





Read more here: Source link