Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?

Re: pattern matching

by choroba (Canon)
on Oct 12, 2012 at 14:57 UTC ( #998715=note: print w/ replies, xml ) Need Help??

in reply to pattern matching

Captures with quantifiers only return the last match. I cannot find it documented, but try

perl -E 'say for "ab" =~ /(.)+/g'
لսႽ ᥲᥒ⚪⟊Ⴙᘓᖇ Ꮅᘓᖇ⎱ Ⴙᥲ𝇋ƙᘓᖇ

Comment on Re: pattern matching
Download Code
Re^2: pattern matching
by Athanasius (Abbot) on Oct 12, 2012 at 16:09 UTC

    How about this?

    #! perl use strict; use warnings; use YAPE::Regex::Explain; my $re = qr/(.)+/; print YAPE::Regex::Explain->new($re)->explain;


    The regular expression: (?-imsx:(.)+) matches as follows: NODE EXPLANATION ---------------------------------------------------------------------- (?-imsx: group, but do not capture (case-sensitive) (with ^ and $ matching normally) (with . not matching \n) (matching whitespace and # normally): ---------------------------------------------------------------------- ( group and capture to \1 (1 or more times (matching the most amount possible)): ---------------------------------------------------------------------- . any character except \n ---------------------------------------------------------------------- )+ end of \1 (NOTE: because you are using a quantifier on this capture, only the LAST repetition of the captured pattern will be stored in \1) ---------------------------------------------------------------------- ) end of grouping ----------------------------------------------------------------------

    Not strictly documentation, but certainly confirmation that “Captures with quantifiers only return the last match.”

    Update: Here is the official documentation:

    /(\d)(\d)/ # Match two digits, capturing them into $1 and $2 /(\d+)/ # Match one or more digits, capturing them all into $1 /(\d)+/ # Match a digit one or more times, capturing the last into + $1

    Note the difference between the second and third patterns. The second form is usually what you want. The third form does not create multiple variables for multiple digits. Parentheses are numbered when the pattern is compiled, not when it is matched.

    The Camel Book, 4th edition, pages 221–222.

    Athanasius <°(((><contra mundum

      Thanks, I believe you are right, but how do I get then all matches as individual elements into the array?

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://998715]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others chilling in the Monastery: (5)
As of 2015-04-02 10:06 GMT
Find Nodes?
    Voting Booth?

    Who makes your decisions?

    Results (63 votes), past polls