Beefy Boxes and Bandwidth Generously Provided by pair Networks
Syntactic Confectionery Delight
 
PerlMonks  

Re: HTML TokeParser - help with using get_text, get_trimmed_text

by steves (Curate)
on Nov 10, 2004 at 00:30 UTC ( #406572=note: print w/ replies, xml ) Need Help??


in reply to HTML TokeParser - help with using get_text, get_trimmed_text

HTML::Parser, while harder to use, will give you control over what tags are parsed and how that parsing is handled per tag if needed.

Since HTML::TokeParser is breaking all tags down for you, you have to examine every token it gives you and put those "back together" that you want to output. Each token has enough information to reconstruct the data for output.


Comment on Re: HTML TokeParser - help with using get_text, get_trimmed_text

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://406572]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others imbibing at the Monastery: (7)
As of 2015-07-08 01:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (93 votes), past polls