|Problems? Is your data what you think it is?|
Perl version of 'diff'?by ninja_byte (Acolyte)
|on Jul 19, 2004 at 22:16 UTC||Need Help??|
ninja_byte has asked for the
wisdom of the Perl Monks concerning the following question:
I have a database filled with about 200,000+ records. Each one has information - domain name, username, host.
This is *not* a dynamic database - I have to manually go through each one of the hosts, extract a master domain list, and parse it from there...
The initial challenge was to get all of the information into a database - that has been handled(albeit crudely).
Now I'm faced with the task of keeping it updated every few days/weeks...
So lets say I have 'host01-original.txt', and I can get 'host01-updated.txt' at any given time. Any suggestions as to the best way of finding new additions to the file only?
The brute force method of a huge 'grep' loop seems a bit distasteful at present.
The file is currently in the format: