How to split big .faa file into smaller .faa files
I have a 10 gb .faa proteomes file that I want to run MAFFT on. But it is too big and hence I need to divide the file. How do I convert it to smaller files in windows without losing any data ? The solutions I have come across are for a UNIX/LINUX based environment
• 30 views
Seqkit
is the answer.
To split into 100 parts:
seqkit split myfile.faa --by-part 100
To split by number of desired sequences per file (eg 5000 per file):
seqkit split myfile.faa --by-part 5000 -by-size
The solutions I have come across are for a UNIX/LINUX based environment
Yes, use linux, if you want to perform any bioinformatics you need to use linux