in reply to
Looking for a XPATH-like tool for HTML documents
Use XML::LibXML in recover mode as described in XML::LibXML::Parser's documentation. This can deal with anything from horribly malformed psuedo-HTML to valid HTML with a DTD.
Results (32 votes),