|
|
| Pathologically Eclectic Rubbish Lister | |
| PerlMonks |
Re: Modern best practices for multilingual regexp alphabetical character matching?by ikegami (Patriarch) |
| on Jan 12, 2009 at 21:40 UTC ( [id://735814]=note: print w/replies, xml ) | Need Help?? |
|
Make sure it's decoded using one or more of the following
And make sure the string us stored internally as UTF-8.
(No need to do use utf8; to use utf8:: functions. use utf8; means the source is in UTF-8.) If you do those two things, regexp will use UNICODE semantics, so \w and character classes will match accented letters, etc.
In Section
Seekers of Perl Wisdom
|
|
||||||||||||||||||||||||||||||