|Perl: the Markov chain saw|
Re^2: Search for similar strings - to standardiseby educated_foo (Vicar)
|on Oct 31, 2009 at 03:33 UTC||Need Help??|
Edit distance would be useful for comparison, but not so much for clustering strings with their common misspellings. For that, you might try n-grams (mentioned above) or locality-sensitive hashing (basically the same thing, but with gaps).