Beefy Boxes and Bandwidth Generously Provided by pair Networks
Welcome to the Monastery

Alternation in Effective Perl Programming Example

by Carl-Joseph (Scribe)
on Dec 07, 2000 at 08:16 UTC ( #45438=perlquestion: print w/replies, xml ) Need Help??
Carl-Joseph has asked for the wisdom of the Perl Monks concerning the following question:

On page 72, the book Effective Perl Programming shows some code to remove C-style comments from a string. I've reproduced the code below.

local $_; $_ = " This_Is_Code = 0; /* This is a comment */ /* * Can it handle a mult-line comment, i.e. * one that goes on for more that one * line? Yes, it can. */ This_Is_Code++; "; for (split m!("(:?\\\W|.)*?"|/\*|\*/)!) { if ($in_comment) { $in_comment = 0 if $_ eq "*/" } else { if ( $_ eq "/*" ) { $in_comment = 1; print " "; } else { print; } } }

I have a question about the regular expression that's used in the call to split(). The expression (:?\\\W|.) seems to say match either a non-word character, or any character. Why have both patterns in the alternation when the dot includes non-word characters?



Replies are listed 'Best First'.
Re: Alternation in Effective Perl Programming Example
by chromatic (Archbishop) on Dec 07, 2000 at 09:03 UTC
    It's a little more than that. Should it be (?:\\\W|.) instead?

    If so, as written, it would be interpreted as non-capturing parenthesis, matching either a backslash and a nonword character or a single character.

    Otherwise, it's (an optional colon followed by a backslash and a non-word character) or (a single character).

      Appears to be (looking at Effective perl) a typo, as the chapter leans heavily on (?: just before the example. You should mention it to merlyn as its not in the errata (neither is the misspelled 'Eart' on page 64 ;-)


        And, to the original question: I think the idea is to 'inch along'
        for (split m!("(?:\\\W|.)*?"|/\*|\*/)!) {
        will return the pieces ending w/ an escaped non-word char, a single char or the comment begin/end markers. Hmm, an escaped non-word char or a single char in quotes? No, as many \\\w|. as found between quotes. I guess so:
        biff = "not a comment /*"; /* marker for not a comment */
        won't start $in_comment too early.


Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: perlquestion [id://45438]
Approved by root
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others lurking in the Monastery: (2)
As of 2017-04-23 06:57 GMT
Find Nodes?
    Voting Booth?
    I'm a fool:

    Results (430 votes). Check out past polls.