Samn has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: regex question
by Zaxo (Archbishop) on Aug 01, 2002 at 03:32 UTC

    use HTML::Parser;

    It handles maniacal markup you'll never think of in your homerolled regexen

    Update: ++mkmcconn suggested I add HTML::TokeParser to the recommendation, and I agree (I knew I was forgetting a good one)

    After Compline,

Re: regex question
by krusty (Hermit) on Aug 01, 2002 at 03:15 UTC
    $body =~ s/<img src=(.*?)>/[image:<a href=$1>$1</a>]/gi;
    Sounds like this might be what you're looking for.

Re: regex question
by Abigail-II (Bishop) on Aug 01, 2002 at 09:44 UTC
    You already identified one of the problems (and solutions have been suggested for that), but let me point out that your regex won't work either if there's whitespace between "src" and "=".

    BTW, HTML doesn't have alt tags. HTML has alt attributes - which have been mandatory for IMG tags for quite some time.


by Samn (Monk) on Aug 01, 2002 at 02:50 UTC