in reply to
first two words-pattern matching
your file seems to be a csv file. Wich is not seperated by commas but by tabs. But the seperator is not important.
Maybe the Text::CSV module or other modules about csv can help you as well.
The same name on every site (unless it is taken already)
A different name for every site
The same name for sites I don't care about, but different names on sites I do care about
Different names for sites I don't care about, but the same name on sites I do care about
The anonymous account
Someone else's username
Root, after hacking into the site
Results (254 votes),