Beefy Boxes and Bandwidth Generously Provided by pair Networks Bob
There's more than one way to do things
 
PerlMonks  

Re: HTML TokeParser - help with using get_text, get_trimmed_text

by steves (Curate)
on Nov 10, 2004 at 00:30 UTC ( #406572=note: print w/ replies, xml ) Need Help??


in reply to HTML TokeParser - help with using get_text, get_trimmed_text

HTML::Parser, while harder to use, will give you control over what tags are parsed and how that parsing is handled per tag if needed.

Since HTML::TokeParser is breaking all tags down for you, you have to examine every token it gives you and put those "back together" that you want to output. Each token has enough information to reconstruct the data for output.


Comment on Re: HTML TokeParser - help with using get_text, get_trimmed_text

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://406572]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (5)
As of 2014-04-20 07:02 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    April first is:







    Results (485 votes), past polls