comment on

This is definetely a problem to go to CPAN for, but in the interest of wheel-reinvention, here is an implementation of my interpretation of your question. The function below matches "sticky-finger" typos, i.e. it matches strings where any number of characters were duplicated (up to a specifiable number of times). That seemed to be the gist of your question... maybe you could provide strings that shouldn't match, too?

use strict;
use warnings;

my $input = 'foo bar baz';
my @test_strings = (
    'fooo bar baz',
    'foo bar baaz',
    'foo bbar baz',
    'ffoo baar baz', # Matches:  Should it?
    'fxo bar baz',   # No match: Different character, not dup.
    'foo baaar baz'  # No match: Too many 'a's!
);
my $fuzziness = 1;

# Let's try out the function
for (@test_strings){
    if (fuzzy_match($input, $_, $fuzziness)){
        print "MATCHED:  $_\n";
    } else{
        print "NO MATCH: $_\n";
    }
}

sub fuzzy_match{
    my ($input, $test_string, $fuzziness) = @_;

    # Build a regex from the input string
    my $regex = '';
    for (split //, $input){
        $regex .= quotemeta($_) . "\{1," . ($fuzziness + 1) . "}?";
    }

    $test_string =~ m/$regex/;
}
[download]

As for the number of characters different, I think that would be length($test_string) - length($input_string).

In reply to Re: off-by-one string comparison by crashtest
in thread off-by-one string comparison by argv

Are you posting in the right place? Check out Where do I post X? to know for sure.
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
Want more info? How to link or How to display code and escape characters are good places to start.


There's more than one way to do things
	PerlMonks