Removing headlines in fasta file in python
I want to remove the headlines of a fasta file that contains multiple protein sequences. I need to count the amino acid numbers.
>Os12t0641500-03 Similar to RecF/RecN/SMC N terminal domain containing protein, expressed. MAAAAAGKGGGGQGRIHRLEVENFKSYKGTQTIGPFFDFTAIIGPNGAGKSNLMDAISFV LIKVPLL* >Os12t0597800-01 Similar to Helix-loop-helix DNA-binding domain containing protein, expressed. MMSFPYSSGDLGEATTAAAAAVDMITLDQMFRDYDASTGDDLFELVWESCGGGEIDSGAG LGRQ* >Os12t0598600-00 Similar to H0315A08.1 protein. MKRSMNYSGIECFTFGDDNKLRIFPPNSYKFKPKDHIILDEVQECILDNFWYQYNNKREE FSDLDTMDLGGHGQPDE*
• 22 views
Read more here: Source link