|laziness, impatience, and hubris|
Re: Remove duplicate entriesby PyrexKidd (Monk)
|on Nov 17, 2010 at 06:55 UTC||Need Help??|
hehe... I see now where:
So here is what I came up with.
this works great if the search key is repeated. what if I have a key that is misspelled etc. i.e.:
where the first part of the name is correct but there is potentially more junk at the end of the name. is there a way to match part of the string and if part of the string matches call it a dup?
the problem is the incoming data isn't consistent. ie there are ten cols across in the CSV, of the ten cols between 4 and 10 are filled in, so comparing the data is not a viable method for sorting DUP entries.
Stylistically, I've always used all caps to represent files, besides STDERR and STDOUT are just glorified file handles anyway, and they use full caps. I understand that lexically scoped file handles are not global variables and that's the differentiation you make--some habits.
Again, thanks for the assistance.