Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer

Re^2: Regex not working

by grinder (Bishop)
on Jul 17, 2009 at 12:03 UTC ( #781038=note: print w/replies, xml ) Need Help??

in reply to Re: Regex not working
in thread Regex not working

I'm trying to get individual values from the table and then convert to pdf... Would HTML::TreeBuilder be a good choice to fetch data?

I've minimal experience with it, mainly because each time I pick it up, I've found the interface cumbersome, and unwieldy to use. And it's pretty slow, relatively speaking, although I don't consider that to be an important point.

I find HTML::Parser much easier to use (although you have to invest some time in learning how to use it). If you install it via a package, do yourself a favour and track down the examples directory that is bundled with the distribution. You will probably find an example that you can adapt to the problem at hand.

It's a complex tool that's worthwhile mastering if you have to grovel around in HTML files.

• another intruder with the mooring in the heart of the Perl

Replies are listed 'Best First'.
Re^3: Regex not working
by prantikd (Novice) on Jul 18, 2009 at 12:20 UTC
    Hi Raghu,

    Since, you are evaluating different perl modules to parse HTML files, you can take a look at HTML::TokeParser. It is an alternative HTML::Parser interface. I have used it and found it pretty helpful.

    - Prantik

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://781038]
[erix]: oh, I see... it's pretty obvious really
[choroba]: What do you mean by "anchors"?
[erix]: anchors
[erix]: I have figured it out. I actually constructed them correctly but used a faulty url *headdesk*
[erix]: ( you use anchors to jump to via a #-suffix to the url -- I am sure you know )

How do I use this? | Other CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (10)
As of 2018-06-19 12:36 GMT
Find Nodes?
    Voting Booth?
    Should cpanminus be part of the standard Perl release?

    Results (113 votes). Check out past polls.