in reply to Search for similar strings - to standardise
I would begin by Super Searching for edit distance. word similarity measure and Creating Dictionaries may also be helpful, as may be the work of Ted Pedersen, in particular, his Ngram Statistics Package.
HTH,
planetscape
In Section
Seekers of Perl Wisdom