Generate hashes for all sequences in a FASTA file

Generate hashes for all sequences in a FASTA file

1

Hello!

I am working on novel transcripts assembled from RNA-Seq data, using Stringtie. However, since stringtie “MSTRG” ids are poorly conserved across runs, I wanted to implement a strategy that converts all transcript sequences in a FASTA file to a sequence-specific hash, which can then be used as part of the header for identification of the transcripts.
Any and all help is appreciated.

Thanks!


stringtie


RNA


hash

• 44 views

Read more here: Source link