note
ryan
You can grab all the possible values for intron and exon with your regex and then split them up.
<BR><BR>
Consider replacing your intron/exon elsif blocks with this:
<CODE>
#new intron elsif block
elsif(/\s+\/intron="(.+)"\n/) {
foreach $item (split('\;',$1)) {
print OUT "Intron\t $item\n";
}
}
</CODE>
I replaced all the *s with +s, from my understanding this is more efficient, but I'm no regex guru :) The regex puts everything between the "double quotes" in $1<BR><BR>
This will print out, based on your input data:
<CODE>
Intron 1-48
Intron 334-385
</CODE>
Now that they are separated, you can do whatever you want with them.<BR><BR>
Ryan
142906
142906