Re: FASTA Splitter


Your skill will accomplish what the force of many cannot
	PerlMonks

Re: FASTA Splitter

by perliff (Monk)

on Jun 02, 2009 at 08:55 UTC ( [id://767502]=note: print w/replies, xml )

Need Help??

in reply to FASTA Splitter

try this... something like this should split your fasta file (called bigfasta here) into several small fasta files based on the sequence display id (i assume your sequences have nice looking identifiers). learn to use bioperl to your advantage for reading and writing biological sequence files... its been done by the excellent bioperl project, and you don't want to reinvent the wheel everytime.


use strict;    # always...
use Bio::SeqIO;

my $bigfasta = "bigfasta.faa";
my $seqin = Bio::SeqIO->new(-file => $bigfasta, -format=>"fasta");

while ($inseq = $seqin->next_seq) {
        my $id = $inseq->display_id;
        my $outfile = "$id.fasta";
        my $seqout = Bio::SeqIO->new(-file=>">$outfile", -format=>"fas
+ta");
        $seqout->write_seq($inseq);

}
[download]

----------------------

"with perl on my side"

"If you look at the code too long, the code also looks back at you"

Comment on Re: FASTA Splitter Download Code

In Section Seekers of Perl Wisdom

Domain Nodelet^?

www.com | www.net | www.org

Node Status^?

node history
Node Type: note [id://767502]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others imbibing at the Monastery: (5)

As of 2024-04-23 21:49 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Today I Learned

Voting Booth^?

No recent polls found