How to align the same samples that were sequenced in multiple flow cells and lanes
Hi,
I have a set of FASTQ files that I want to align to the reference genome. The sequencing for each sample has been done on 2 different runs (flow cells) and 2 different lanes so for each sample I have 4 files. I am not sure when I should merge my files, before or after alignment? I read previous posts that suggest to merge the samples after alignment, but I am not sure what is the best in my case. Could I merge the samples using samtools? Do I just simply cat one at the end of the other?
An example for sample1 is shown below (FC = flow cell, L = Lane)
sample1.FC1.L1
sample1.FC1.L2
sample1.FC2.L1
sample1.FC2.L2
Thanks a lot in advance!
• 2.5k views
Read more here: Source link