http://www.perlmonks.org?node_id=11157922


in reply to Re: Module to extract text from HTML
in thread Module to extract text from HTML

Mojo::DOM is a parser which makes this trivial, however I get the impression from question that it's less about selecting a particular parts of the page ('just extracting the p tags which is not quite good enough'), and more about 'all' of the text.