http://www.perlmonks.org?node_id=1172911


in reply to Re: Finding Nearly Identical Sets
in thread Finding Nearly Identical Sets

herveus,

It isn't a string of digits though you could think of them that way.

Have you ever tried executing the Levenshtein edit distance a trillion times? Even the XS version isn't that fast. Let's say I get 2 million messages a day and I have 500K different sets/strings to compare against - this isn't the way to go.

Cheers - L~R