comment on

Given this little snippet, it appears that the goal of the first loop is to remove all the lines that don't end with 0 and put those into an array. The next loop then reads that array. So, you're iterating over your data twice, but you don't need to. You've got 1 completely copy of the file in memory, and another mostly completely copy. It takes time to create and destroy those copies. You don't need to do that either.

open(IN,$my_file) || die "Can't open: $!";
while (my $line = <IN>) {
    next unless $line =~ /0\s*$/;
    $line =~ /^(\d+.?\d*)//;
    if ($1 > .5) {
        # etc, etc
    }
}
[download]

here's the break down line by line

Open the file
Process each line in turn. So, we're not storing the whole thing in memory (takes time to allocate said memory)
The if $1 == 0 part of your first loop only kept lines that ended in 0 for @zeroat. Just test directly for the line ending in 0 and throw away the rest.
no need for the regexp to catch the last character, we already know it's a 0
Test only the > .5 part, again no need for the $2 == 0 since we already threw away every line that didn't end in 0.
Rest of processing

HTH

/\/\averick
OmG! They killed tilly! You *bleep*!!

In reply to Re: Processing large files many times over by maverick
in thread Processing large files many times over by dimmesdale

Are you posting in the right place? Check out Where do I post X? to know for sure.
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
Want more info? How to link or How to display code and escape characters are good places to start.


No such thing as a small change
	PerlMonks