Extract specific reads from fastq or SRA

Extract specific reads from fastq or SRA

0

Hi everyone.,

I found one data of my interest from SRA. But the file is too big.

Is it possible to obtain specific reads from SRA (preferably) based on sequence string of my gene or region of interest?

Also if it is possible to do it with fastq file would be manageable?

Basically I want to avoid aligning full data which could be time-consuming and memory intensive. So I want to focus only reads of my interest.

Note:

  1. I tried seqtk but it require read name and I don’t know which read name to take.

  2. grepping the sequence is another option but I would lose fastq based information

That’s why I am looking for tools based on read subset/seed sequence. eg. AAATTCGC

I would appreciate any help.

Thank you


reads


filter


fastq


sra

• 23 views

updated 1 hour ago by

108k

written 2 hours ago by

▴

290

Read more here: Source link