Beefy Boxes and Bandwidth Generously Provided by pair Networks
No such thing as a small change

Re: Ignoring specific html tags before parsing

by roboticus (Chancellor)
on Oct 07, 2013 at 03:05 UTC ( #1057211=note: print w/ replies, xml ) Need Help??

in reply to Ignoring specific html tags before parsing


If you look at HTML::Parser, it has a couple of examples. In fact, the second one is very close to what you're wanting. Since HTML::TreeBuilder builds on top of HTML parser, you should be able to tweak it to do what you want when you parse it. Looking at the docs, it appears that the eg/hstrip example in the distribution can be coerced into doing what you're attempting to do.

Disclaimer: I've not done anything significant with HTML::Parser.


When your only tool is a hammer, all problems look like your thumb.

Comment on Re: Ignoring specific html tags before parsing

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1057211]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others studying the Monastery: (3)
As of 2015-10-09 03:30 GMT
Find Nodes?
    Voting Booth?

    Does Humor Belong in Programming?

    Results (232 votes), past polls