http://www.perlmonks.org?node_id=56686


in reply to How to get HTML::Parser to return a line of parsed text

It goes something like this:

#!/usr/bin/perl -w use strict; use HTML::Parser; my $text; my $p = HTML::Parser->new(text_h => [ sub {$text .= shift}, 'dtext']); $p->parse_file('test.html'); print $text;

which, when used on a file like this:

<html> <head> <title>Test</title> </head> <body> <h1>Test Stuff</h1> <p>This is a test</p> <ul> <li>this</li> <li>is a</li> <li>list</li> </ul> </body> </html>

produces the following output:

Test Test Stuff This is a test this is a list

Does that help?

Update: But merlyn's solution is way cooler.

--
<http://www.dave.org.uk>

"Perl makes the fun jobs fun
and the boring jobs bearable" - me