Beefy Boxes and Bandwidth Generously Provided by pair Networks
good chemistry is complicated,
and a little bit messy -LW

Re^6: Extract sequence of UC words?

by BrowserUk (Pope)
on Aug 19, 2008 at 06:31 UTC ( #705145=note: print w/replies, xml ) Need Help??

in reply to Re^5: Extract sequence of UC words?
in thread Extract sequence of UC words?

I upvoted your post above, but still your regex m/(\b(?:[A-Z]+(?:\s+[A-Z]+)*)+\b)/g made me squirm. Whenever I see sequences of nested quantifiers like that:+)*)+ I get uncomfortable, remembering various pathelogical cases I've constructed in the past.

To that end, I thunk again, and came up with this which I believe meets the 'spec', whilst avoiding the nested quantifiers;

m[ ( \b [A-Z] (?: [A-Z\s]* [A-Z] )? \b ) ]gx

Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
"Science is about questioning the status quo. Questioning authority".
In the absence of evidence, opinion is indistinguishable from prejudice.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://705145]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others studying the Monastery: (11)
As of 2016-10-21 15:48 GMT
Find Nodes?
    Voting Booth?
    How many different varieties (color, size, etc) of socks do you have in your sock drawer?

    Results (289 votes). Check out past polls.