Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid
 
PerlMonks  

Re^2: HTML::Parser fun

by FreakyGreenLeaky (Sexton)
on Jun 04, 2008 at 14:08 UTC ( #690144=note: print w/ replies, xml ) Need Help??


in reply to Re: HTML::Parser fun
in thread HTML::Parser fun

thanks for the info: I seem to recall testing HTML::Treebuilder and finding it lagging behind HTML::Parser in terms of performance (HTML::TokeParser::Simple was the worst performer, but easiest to use).

Our problem is that that performance penalty really becomes a problem when we're processing hundreds of millions of files...

Hence the choice of HTML::Parser. Now that I've got a taste of it's performance benefits, I'm loath to let go.


Comment on Re^2: HTML::Parser fun

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://690144]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others imbibing at the Monastery: (8)
As of 2015-07-08 08:24 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (96 votes), past polls