http://www.perlmonks.org?node_id=500121


in reply to Re: html analysis tool via regex
in thread html analysis tool via regex

That you mention "database entries" makes me wonder if modules such as HTML::TableContentParser, HTML::EasyTable, or HTML::TableExtract might be helpful.

And while this wasn't exactly what you inquired about, I would also recommend HTML TIDY which can help clean up lots of messy things in your HTML.

HTH,

planetscape