Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number

Re^2: Web scraping toolkit?

by mzedeler (Pilgrim)
on Jan 27, 2012 at 08:44 UTC ( #950285=note: print w/replies, xml ) Need Help??

in reply to Re: Web scraping toolkit?
in thread Web scraping toolkit?

I think that App::scrape may turn out to be insufficient, not covering some edge cases that needs handling. But again - thats my general worry, not having tried any of the scraping modules yet (the same goes for Web::Scraper and Scrappy).

WWW::Mechanize::Firefox looks very promising, and implementing the few extra features that Scrapie has (logging and such) shouldn't be a problem. The real drawback lies in having to rely on firefox (or some similar component) in development and production.

I'll go back to the drawing board and see what to do. Thanks for the pointers.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://950285]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others having an uproarious good time at the Monastery: (5)
As of 2018-01-20 14:05 GMT
Find Nodes?
    Voting Booth?
    How did you see in the new year?

    Results (226 votes). Check out past polls.