Beefy Boxes and Bandwidth Generously Provided by pair Networks
Your skill will accomplish
what the force of many cannot
 
PerlMonks  

Re: FASTA Splitter

by perliff (Monk)
on Jun 02, 2009 at 08:55 UTC ( [id://767502]=note: print w/replies, xml ) Need Help??


in reply to FASTA Splitter

try this... something like this should split your fasta file (called bigfasta here) into several small fasta files based on the sequence display id (i assume your sequences have nice looking identifiers). learn to use bioperl to your advantage for reading and writing biological sequence files... its been done by the excellent bioperl project, and you don't want to reinvent the wheel everytime.
use strict; # always... use Bio::SeqIO; my $bigfasta = "bigfasta.faa"; my $seqin = Bio::SeqIO->new(-file => $bigfasta, -format=>"fasta"); while ($inseq = $seqin->next_seq) { my $id = $inseq->display_id; my $outfile = "$id.fasta"; my $seqout = Bio::SeqIO->new(-file=>">$outfile", -format=>"fas +ta"); $seqout->write_seq($inseq); }
----------------------

"with perl on my side"

"If you look at the code too long, the code also looks back at you"

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://767502]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others imbibing at the Monastery: (5)
As of 2024-04-23 21:49 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found