Beefy Boxes and Bandwidth Generously Provided by pair Networks
good chemistry is complicated,
and a little bit messy -LW
 
PerlMonks  

Re: Ignoring specific html tags before parsing

by roboticus (Canon)
on Oct 07, 2013 at 03:05 UTC ( #1057211=note: print w/ replies, xml ) Need Help??


in reply to Ignoring specific html tags before parsing

ganeshPerlStarter:

If you look at HTML::Parser, it has a couple of examples. In fact, the second one is very close to what you're wanting. Since HTML::TreeBuilder builds on top of HTML parser, you should be able to tweak it to do what you want when you parse it. Looking at the docs, it appears that the eg/hstrip example in the distribution can be coerced into doing what you're attempting to do.

Disclaimer: I've not done anything significant with HTML::Parser.

...roboticus

When your only tool is a hammer, all problems look like your thumb.


Comment on Re: Ignoring specific html tags before parsing

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1057211]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (6)
As of 2014-10-31 06:15 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    For retirement, I am banking on:










    Results (215 votes), past polls