<?xml version="1.0" encoding="windows-1252"?>
<node id="1016073" title="Re: PROTEIN FILE help me pleaseee" created="2013-01-30 09:58:52" updated="2013-01-30 09:58:52">
<type id="11">
note</type>
<author id="918402">
2teez</author>
<data>
<field name="doctext">
&lt;p&gt;
Hi [serafinososi],&lt;br&gt;&lt;br&gt;
 &lt;i&gt;...If the line starts with “&gt;” (it is the first line of a FASTA file) the line is not considered...&lt;/i&gt;&lt;br&gt;&lt;br&gt;
What if the line that starts with "&gt;" is more than one in the file what happens?&lt;br&gt;&lt;br&gt;
 If I understand the OP's question, using the data provided, if I may suggest (adding to what others have said) using perl function [doc://split] may do like so:
&lt;c&gt;
use warnings;
use strict;

my $protein;

while (&lt;DATA&gt;) {
    if (/^&gt;/) {
        next;
    }
    else {
        $protein = join '', split;
    }
}

my $number_of_F = grep { /F/ } split //, $protein;

print "The aminoacid sequence: ", $protein, " contains ", $number_of_F,
  " Phenylalanine aminoacids", $/;

__DATA__
&gt;gi|403369491|gb|EJY84591.1| Transcriptional regulator, Sir2 family protein Oxytricha trifallax

MMKQLIKHNKNTPLFNFLRVKFSSTAATIQTQQTVNKPIESKFKEEKLDNYHDIYEKSKRLAEQISQSKS FICFTGAGLSTSTGIPDYRSTSNTLAQTGAGAYELEISEEDKKSKTRQIRSQVQRAKPSISHMALHALME NGYLKHLISQNTDGLHLKSGIPYQNLTELHGNTTVEYCKSCSKIYFRDFRCRSSEDPYHHLTGRQCEDLK CGGELADEIVHFGESIPKDKLVEALTAASQSDLCLTMGTSLRVKPANQIPIQTIKNKGQLAIVNLQYTPF DEIAQIRMHSFTDQVLEIVCQELNIKIPEYQMKRRIHIIRNAETNEIVVYGSYGNHKNIKLSFMQRMEYI DNKNHVYLALDKEPFHIIPDYFNFQNINTDQEEVEFRIHFYGHNSEPYFQLTLPRQSILELQAGEHLICD ITFDYDKLEWK
&lt;/c&gt;
&lt;/p&gt;
&lt;!-- Node text goes above. Div tags should contain sig only --&gt;
&lt;div class="pmsig"&gt;&lt;div class="pmsig-918402"&gt;
If you tell me, I'll forget.&lt;br&gt;
If you show me, I'll remember.&lt;br&gt;
if you involve me, I'll understand.&lt;br&gt;
--- Author [unknown to me]
&lt;/div&gt;&lt;/div&gt;</field>
<field name="root_node">
1016025</field>
<field name="parent_node">
1016025</field>
</data>
</node>
