Re: Dump Text from HTML

by OeufMayo (Curate)
on Jul 18, 2001 at 14:07 UTC

in reply to Dump Text from HTML

And if you want something more reliable and a bit less brutal:
(require HTML::Parser v.3 or higher)

perl -MHTML::Parser -e '$p=HTML::Parser->new(text_h=>[su b{print shift},"dtext"]);for(@ARGV){$p->parse_file($_)}' file.html <kbd>--
Re: Re: Dump Text from HTML
on Jul 26, 2001 at 14:05 UTC
    hello, and thanks for your reply. i'm just a perl novice, reading carefully the camel book, and i wanted to exchange portability versus elegance! i'm now tryng to optimize my code and make it work well enough to be satisfied by translating the html subset i more frequently find into pages i look at! soon i will post my results. i'm worried about giving this code to someone who hasn't the HTML::Parser module, and so i use regexp. thanks again, and bye SiG

