Beefy Boxes and Bandwidth Generously Provided by pair Networks
We don't bite newbies here... much
 
PerlMonks  

Re: About \d \w and \s

by Corion (Pope)
on Oct 18, 2009 at 16:14 UTC ( #801880=note: print w/replies, xml ) Need Help??


in reply to About \d \w and \s

Personally, I've avoided relying on unicode/charset semantics with regular expressions. Most of the input I deal with is either Latin-1 or some other "near IBM-ASCII" single byte encoding, and so is my source code. I've made my regular expressions lenient in the sense that I use dots where I expect umlauts.

Of course, if I were more strict about the encodings of my input data, or Perl were more smart about guessing the encoding of my input data (which is hard without carrying a dictionary of likenesses), I could write my source code and my regular expressions in unicode, and then it would be cool if \w would use the unicode semantics.

I have no opinion on \d, as German has only 0-9 as digits anyway, and my input data also.

Replies are listed 'Best First'.
Re^2: About \d \w and \s
by demerphq (Chancellor) on Oct 18, 2009 at 16:19 UTC

    I believe that /u would provide sane matching for German or other latin-1 languages as it would make perl match according to the unicode rules even when the string/pattern weren't themselves unicode.

    ---
    $world=~s/war/peace/g

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://801880]
help
Chatterbox?
[ambrus]: Ok, the docs is somewhat unclear. It does say that when an object is garbage collected, it will get cleaned up, and eventually can no longer get messages. It's not clear how long this takes, eg. I think it's kept alive until its queued events are handled
[ambrus]: in the loop, and I'm not sure if that's ok for AnyEvent. Also, it's not clear if a Timer or File object you free really is garbage collected, i.e. that Prima doesn't keep some references to it, but I hope so.
[Corion]: choroba: No, I don't remember that story, but yes, it matches my experience ;))
[ambrus]: Hopefull the object isn't kept alive, the events are processed immediately, but you'd have to read a lot of source code to be sure about that.
[Corion]: ambrus: I think both of AnyEvent and Prima are pretty tight in their memory management because they both are cooperative multitasking and (I think) both use the Perl memory management for managing things
[Corion]: ambrus: And for Windows, I don't think that Prima knows if there still are messages queued for an object (in the Windows message loop). Finding that out would take lots of effort for little gain
[ambrus]: And even if this works, I'm still not sure you can't get double timeouts from a Timer.
[ambrus]: Corion: well Prima::Object says something like that the cleanup method will send an onDestory message and that you can't get more messages after cleanup, or something.
[Corion]: ambrus: Yeah - I don't think the deep source dive will be necessary if things are implemented as simple as they could be :)) And hopefully I won't need (more) timely object destruction. I can update the screen at 60Hz and hopefully even do HTTP ...
[Corion]: ... transfers in the background. Now that I think about it, this maybe even means that I can run the OpenGL filters on Youtube input :)

How do I use this? | Other CB clients
Other Users?
Others studying the Monastery: (8)
As of 2016-12-09 10:25 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    On a regular basis, I'm most likely to spy upon:













    Results (150 votes). Check out past polls.