Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister

Re: Ignoring specific html tags before parsing

by roboticus (Chancellor)
on Oct 07, 2013 at 03:05 UTC ( #1057211=note: print w/replies, xml ) Need Help??

in reply to Ignoring specific html tags before parsing


If you look at HTML::Parser, it has a couple of examples. In fact, the second one is very close to what you're wanting. Since HTML::TreeBuilder builds on top of HTML parser, you should be able to tweak it to do what you want when you parse it. Looking at the docs, it appears that the eg/hstrip example in the distribution can be coerced into doing what you're attempting to do.

Disclaimer: I've not done anything significant with HTML::Parser.


When your only tool is a hammer, all problems look like your thumb.

  • Comment on Re: Ignoring specific html tags before parsing

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1057211]
Discipulus the pope is the last good politic here around..sigh
[Discipulus]: erix the right order of books is: The Three Musketeers, Twenty Years After, The Count of Moret; The Red Sphinx (not in the cycle but a must read!), The Vicomte de Bragelonne

How do I use this? | Other CB clients
Other Users?
Others meditating upon the Monastery: (10)
As of 2017-05-24 07:09 GMT
Find Nodes?
    Voting Booth?