Beefy Boxes and Bandwidth Generously Provided by pair Networks
Think about Loose Coupling
 
PerlMonks  

Re^2: Regarding the new \w regexp escape in 5.11

by ajs (Initiate)
on Oct 05, 2009 at 20:16 UTC ( #799339=note: print w/ replies, xml ) Need Help??


in reply to Re: Regarding the new \w regexp escape in 5.11
in thread Regarding the new \w regexp escape in 5.11

If we're going to make this change (which appears to be compatible with other Unicode-handling modern regexen such as Python and PCRE), we should at least provide a way out for the user who wants true Unicode support without having to jump through lots of hoops. Python, for example, does this with (?u). Since Perl 5 uses (?letter) to map to the modifier letters, it seems obvious to make this a modifier :u, which should probably be turned on by default with "use locale".

Doing that gives the expected behavior for POSIX-friendly uses and yet avoids snubbing users of P5 regexes who routinely match text from other languages/regions.

na´vetÚ (n) - Assuming your experiences map cleanly to the set of all experiences....


Comment on Re^2: Regarding the new \w regexp escape in 5.11
Re^3: Regarding the new \w regexp escape in 5.11
by demerphq (Chancellor) on Oct 06, 2009 at 09:54 UTC

    Im wondering if you somehow dropped a "not" in your first parenthetical remark.

    The fundamental problem here is that \w and behaves different if the string is utf8 or not. We want to make it so \w does the same thing regardless. That means that we end up breaking someones code. I really dont want to have to support three modes, one for the current broken behaviour, one for utf8 and one for ascii. I would much rather just support one mode, and have it be able to cover all the bases. Whether this is feasable or not going forward isnt clear.

    Feel free to provide more details on how these issues are tackled in other languages.

    ---
    $world=~s/war/peace/g

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://799339]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others studying the Monastery: (17)
As of 2014-07-11 19:39 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    When choosing user names for websites, I prefer to use:








    Results (235 votes), past polls