Generate hashes for all sequences in a FASTA file
Hello!
I am working on novel transcripts assembled from RNA-Seq data, using Stringtie. However, since stringtie “MSTRG” ids are poorly conserved across runs, I wanted to implement a strategy that converts all transcript sequences in a FASTA file to a sequence-specific hash, which can then be used as part of the header for identification of the transcripts.
Any and all help is appreciated.
Thanks!
• 44 views
Read more here: Source link