Beefy Boxes and Bandwidth Generously Provided by pair Networks
Think about Loose Coupling

Re^2: Regex not working

by grinder (Bishop)
on Jul 17, 2009 at 12:03 UTC ( #781038=note: print w/replies, xml ) Need Help??

in reply to Re: Regex not working
in thread Regex not working

I'm trying to get individual values from the table and then convert to pdf... Would HTML::TreeBuilder be a good choice to fetch data?

I've minimal experience with it, mainly because each time I pick it up, I've found the interface cumbersome, and unwieldy to use. And it's pretty slow, relatively speaking, although I don't consider that to be an important point.

I find HTML::Parser much easier to use (although you have to invest some time in learning how to use it). If you install it via a package, do yourself a favour and track down the examples directory that is bundled with the distribution. You will probably find an example that you can adapt to the problem at hand.

It's a complex tool that's worthwhile mastering if you have to grovel around in HTML files.

• another intruder with the mooring in the heart of the Perl

Replies are listed 'Best First'.
Re^3: Regex not working
by prantikd (Novice) on Jul 18, 2009 at 12:20 UTC
    Hi Raghu,

    Since, you are evaluating different perl modules to parse HTML files, you can take a look at HTML::TokeParser. It is an alternative HTML::Parser interface. I have used it and found it pretty helpful.

    - Prantik

Log In?

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://781038]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others surveying the Monastery: (3)
As of 2022-05-28 07:28 GMT
Find Nodes?
    Voting Booth?
    Do you prefer to work remotely?

    Results (98 votes). Check out past polls.