Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?
 
PerlMonks  

Re: To find Cyrillic characters - unicode

by Zaxo (Archbishop)
on Aug 03, 2007 at 04:53 UTC ( [id://630448]=note: print w/replies, xml ) Need Help??


in reply to To find Cyrillic characters - unicode

The regex unicode block property '\P{InCyrillic}' will get you what you want. You may need to open the file in ':utf8' mode.

Isolating your match to particular xml elements will require one of the XML modules. That ought to make the text utf8 by default, but old perls may be idiosyncratic about that.

After Compline,
Zaxo

  • Comment on Re: To find Cyrillic characters - unicode

Replies are listed 'Best First'.
Re^2: To find Cyrillic characters - unicode
by Anonymous Monk on Aug 03, 2007 at 06:53 UTC
    Many Thanks!!!!!!!!!!

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://630448]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others lurking in the Monastery: (3)
As of 2024-04-19 20:00 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found