Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: most efficient regex to delete duplicate words

by Anonymous Monk
on Aug 14, 2001 at 00:33 UTC ( [id://104585]=note: print w/replies, xml ) Need Help??


in reply to most efficient regex to delete duplicate words

A simple start.

$string =~ s/(\w+)(\W\1)*/\1/g;

DelPapa

Replies are listed 'Best First'.
Re: Re: most efficient regex to delete duplicate words
by coolmichael (Deacon) on Aug 14, 2001 at 06:41 UTC
    I'm still in the process of learning regexes, but I would think that

    $strint=~s/(\w+)(?:\W\1)+/\1/g;

    would be a little faster, as you don't need to store the second parenthesis match thingy, and you don't need to match zero or more copies of the first match, you need to match one or more.

    Can someone please correct me or confirm this thought? Thanks.

    Michael

    PS: I've also been thinking about the /o modifier. Would this be a good place to use it?

      PS: I've also been thinking about the /o modifier. Would this be a good place to use it?
      $strint=~s/(\w+)(?:\W\1)+/\1/g;

      Nope. And heres why.

      The /o modifier relates specifically to interpolated variables inside of the regex. For instance, the regex:

      m/$matchvar/

      Is a candidate for the /o modifier, since it will be built from the value of $matchvar. With the /o modifier, the resulting "regex engine" will be cached for future use. The next time we run across this regex, we'll use the same engine, even if $matchvar has changed.

      The common analogy is that the /o modifier is like a promise. You promise not to change any of the variables in the regex, in return for better performance. If you break your promise, your program will break as well.

      Your original regex has no interpolated variables, so the regex is only compiled once and we use that copy everytime regardless of the /o modifier.

      In other words, the /o modifier would be redundant on your regex.

      -Blake

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://104585]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others wandering the Monastery: (7)
As of 2024-03-28 18:56 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found