Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid
 
PerlMonks  

Re^5: Best way to search file

by Marshall (Canon)
on Apr 16, 2015 at 21:02 UTC ( [id://1123663]=note: print w/replies, xml ) Need Help??


in reply to Re^4: Best way to search file
in thread Best way to search file

One way to strip out the "-" characters is like this:
#!usr/bin/perl use strict; use warnings; foreach my $ssn qw(123-45-6789 987654321) { my $digits = $ssn; $digits =~ s/-//g; print "$ssn \t$digits\n"; } __END__ prints: 123-45-6789 123456789 987654321 987654321
I am not sure of the best way to handle this "sometimes field 2 vs 4" without seeing a few example lines of these databases. Don't post any real SSNs!

As mentioned before, your HUGE performance gain will come by processing each of the 2 files only once. Process file 2 first to make a memory structure, then process file 1 line by line. Each file only should be read once.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1123663]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others exploiting the Monastery: (9)
As of 2024-04-19 07:56 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found