http://www.perlmonks.org?node_id=951925


in reply to Re: Regular expression
in thread Regular expression

If the input is not well-formed XML but HTML, you can use parse_html instead of parse in order to internally convert the HTML to XHTML before processing.