Re: Remove array if data inconsistent

It's not a direct answer to your question, which I think people have already commented on -- but once that you've pared down the list, you might want to consider making some sort of visualization of the line, so that you can quickly scan it visually and see what might be worth investigating further.

For the data that you're dealing with, I'd probably look at using sparklines -- there's a few CPAN modules to generate them.

Then, you can look at a page of graphs, and see which ones are stable / going up / random / etc.

Comment on Re: Remove array if data inconsistent

Replies are listed 'Best First'.

Re^2: Remove array if data inconsistent
by BJ_Covert_Action (Beadle) on Apr 22, 2009 at 16:16 UTC

I am thinking something along these lines (pseudocode) (in case you aren't terribly familiar with syntax):

open INPUT, "name_of_input_file" or die: $!;
open GOOD_OUTPUT, ">good_data.csv" or die $!;
open BAD_OUTPUT, ">discarded_data.csv" or die $!;

while(my $line = <INPUT>){
    chomp $line;
    if(data_meets_good_condition){
        print GOOD_OUTPUT "$line\n"; 
    }else{
        print BAD_OUTPUT "$line\n";
    }
}
[download]

That's a little rudimentary (and verbiose if you are a fan of golf) and needs an appropriate logical check in the if condition, but the primary idea is the if-else check, that way you don't totally wipe data by accident. Again, in terms of filtering the data, some of the methods discussed above are probably better.

[reply]
[d/l]


Pathologically Eclectic Rubbish Lister
	PerlMonks