Remove vector sequences from genome database

Remove vector sequences from genome database

1

Hi,

I’m building a database containing Refseq genome sequences from selected bacterial species, which will be used for Nanopore sequencing of environmental samples.

In order to eliminate chances of false positives, I used the UniVec database to locate any potential contamination and got substantial hits to several vectors.
I am pretty new to bioinformatics and therefore I wanted to hear if anyone has any ideas of how to mask/remove the contamination from the genome sequences?

/Helena


refseq


univec


vector


database


contamination

• 41 views

updated 59 minutes ago by

108k

written 2 hours ago by

▴

10

Read more here: Source link