Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re: Ignoring specific html tags before parsing

by roboticus (Chancellor)
on Oct 07, 2013 at 03:05 UTC ( #1057211=note: print w/replies, xml ) Need Help??


in reply to Ignoring specific html tags before parsing

ganeshPerlStarter:

If you look at HTML::Parser, it has a couple of examples. In fact, the second one is very close to what you're wanting. Since HTML::TreeBuilder builds on top of HTML parser, you should be able to tweak it to do what you want when you parse it. Looking at the docs, it appears that the eg/hstrip example in the distribution can be coerced into doing what you're attempting to do.

Disclaimer: I've not done anything significant with HTML::Parser.

...roboticus

When your only tool is a hammer, all problems look like your thumb.

  • Comment on Re: Ignoring specific html tags before parsing

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1057211]
help
Chatterbox?
Discipulus the pope is the last good politic here around..sigh
[Discipulus]: erix the right order of books is: The Three Musketeers, Twenty Years After, The Count of Moret; The Red Sphinx (not in the cycle but a must read!), The Vicomte de Bragelonne

How do I use this? | Other CB clients
Other Users?
Others meditating upon the Monastery: (10)
As of 2017-05-24 07:09 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?