http://www.perlmonks.org?node_id=1061301


in reply to Merge 2 files using key field

Jim,
LanX indicated that the Monastery was a learning site. It is also a social site and a teaching site among other things. While I agree with LanX and Corion that you would have done yourself a favor by first searching to see if this same problem had already been asked, I can imagine that it is possible you did that and didn't understand or found that they didn't work for one reason or another. Unfortunately, you didn't indicate that so we have to assume you didn't bother searching (hopefully lesson learned).

As for your actual problem at hand, you need to state your constraints and your objectives. Below is a hypothetical list:

Had you done what I did above (remember, these are made up constraints/goals), it would have been easy to say "I looked at <some thread> but it won't work for me because of X".

If I take the simplest assumptions (file 1 can fit into memory, order doesn't matter, keys are unique, no lines in file 1 not in file 2) then the solution should be pretty obvious.

  1. Read file 1 into a hash (key = first field, value = second field)
  2. Read file 2 line by line
  3. If first field is not in hash, move on to next line
  4. Else, print out the value in the hash, a comma and field 2
I could even easily work around the limitation if file 1 contains records not in file 2:
  1. Read file 1 into a hash (key = first field, value = second field)
  2. Read file 2 line by line
  3. If first field is not in hash, move on to next line
  4. Else, print out the value in the hash, a comma and field 2
  5. Delete the hash entry just printed
  6. When 2nd file has been read completely, iterate over hash and print out line for each entry (not found in file 2)

I hope this helps you understand how you can help yourself.

Cheers - L~R