How to split big .faa file into smaller .faa files

How to split big .faa file into smaller .faa files

1

I have a 10 gb .faa proteomes file that I want to run MAFFT on. But it is too big and hence I need to divide the file. How do I convert it to smaller files in windows without losing any data ? The solutions I have come across are for a UNIX/LINUX based environment


faa


proteomes

• 30 views

updated 1 hour ago by

▴

920

written 2 hours ago by

0

Seqkit is the answer.

To split into 100 parts:

seqkit split myfile.faa --by-part 100

To split by number of desired sequences per file (eg 5000 per file):

seqkit split myfile.faa --by-part 5000 -by-size

The solutions I have come across are for a UNIX/LINUX based environment

Yes, use linux, if you want to perform any bioinformatics you need to use linux


Login
before adding your answer.

Source link