http://www.perlmonks.org?node_id=781038


in reply to Re: Regex not working
in thread Regex not working

I'm trying to get individual values from the table and then convert to pdf... Would HTML::TreeBuilder be a good choice to fetch data?

I've minimal experience with it, mainly because each time I pick it up, I've found the interface cumbersome, and unwieldy to use. And it's pretty slow, relatively speaking, although I don't consider that to be an important point.

I find HTML::Parser much easier to use (although you have to invest some time in learning how to use it). If you install it via a package, do yourself a favour and track down the examples directory that is bundled with the distribution. You will probably find an example that you can adapt to the problem at hand.

It's a complex tool that's worthwhile mastering if you have to grovel around in HTML files.

• another intruder with the mooring in the heart of the Perl

Replies are listed 'Best First'.
Re^3: Regex not working
by prantikd (Novice) on Jul 18, 2009 at 12:20 UTC
    Hi Raghu,

    Since, you are evaluating different perl modules to parse HTML files, you can take a look at HTML::TokeParser. It is an alternative HTML::Parser interface. I have used it and found it pretty helpful.

    - Prantik