I've certainly never tested it with conditionals.
I think it's the conditional indeed, because it works smoothly when I re-write it to not use a conditional:
%
^
\s*
( # <1>
# Single tags like <foo/>
<
\s*
[a-zA-Z:]+
(?:
\s*[a-zA-Z:]*
\s* = \s*
(?:'[^']*'|"[^"]*")
)*
\s*
/\s*
>
|
# Tags in pairs like <foo>content</foo>
<
\s*
([a-zA-Z:]+) # <2/>
(?:
\s*[a-zA-Z:]*
\s* = \s*
(?:'[^']*'|"[^"]*")
)*
\s*
>
(?:[^<>]* | (?1))*
<\s*/\s*\2\s*>
) # </1>
\s*
$
%x
<foo><bar></bar></foo> # Match
<foo><bar></foo></bar> # No match
<foo><bar/></foo> # Match
<foo><bar></foo> # No match
<foo bar=baz/> # No match
<foo bar="baz"> # No match
<foo bar="baz"/> # Match
< fooo / > # Match
<foo/>foo # No match
foo<foo/> # No match
<foo>foo</foo> # Match
<foo><bar/>foo</foo> # Match
#<a><b><c></c></b></a> # No match (WRONG!!)
Now, there's still the three-level-deep problem...
U28geW91IGNhbiBhbGwgcm90MTMgY
W5kIHBhY2soKS4gQnV0IGRvIHlvdS
ByZWNvZ25pc2UgQmFzZTY0IHdoZW4
geW91IHNlZSBpdD8gIC0tIEp1ZXJk
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
|
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.
|
|