Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

Re: Comparing strings (exact matches) in LARGE numbers FAST

by johndageek (Hermit)
on Sep 02, 2008 at 17:01 UTC ( [id://708552]=note: print w/replies, xml ) Need Help??


in reply to Comparing strings (exact matches) in LARGE numbers FAST

Just some idea on how to handle this
sort primary file (100s of millions) split file into ~ 10 milion record sub files named for the last key contained in each file sort secondary file -> ssf1 you may sort multiple secondary files (ssf2, ssf3 etc) perl program read directory containing primary files by name create an array containing the file names (@pfn)(last key in the ar +ray) open ssf1 ssf2 ssf3 ssf4 $ssf1_record=" "; # or a value lower than lowest value $ssf2_record=" "; # or a value lower than lowest value $ssf3_record=" "; # or a value lower than lowest value $ssf4_record=" "; # or a value lower than lowest value foreach $pf (@pfn){ open primary file ($pf) read $pf records into a hash ($pfrh) while ($ssf1_record < $pf){ compare to hash if found{ action } read $ssf1 record; } while ($ssf2_record < $pf){ compare to hash if found{ action } read $ssf2 record; } }

Enjoy!
Dageek

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://708552]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others wandering the Monastery: (5)
As of 2024-04-23 15:11 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found