Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re^2: How do I Extract Data From Amazon.com's Website?

by Joost (Canon)
on Aug 15, 2007 at 18:50 UTC ( #632829=note: print w/replies, xml ) Need Help??


in reply to Re: How do I Extract Data From Amazon.com's Website?
in thread How do I Extract Data From Amazon.com's Website?

If you're dealing with a web interface (i.e. HTML with forms, links, etc instead of "pure" HTTP) you're probably better off using WWW::Mechanize - it's an LWP subclass with lots of specialized methods to search and "click" through web pages/forms.

There are even a few "clones" of WWW::Mechanize that use popular browsers at the back-end so you can deal with javascript and other client-side objects not normally supported by WWW::Mechanize.

  • Comment on Re^2: How do I Extract Data From Amazon.com's Website?

Replies are listed 'Best First'.
Re^3: How do I Extract Data From Amazon.com's Website?
by erroneousBollock (Curate) on Aug 16, 2007 at 06:58 UTC
    Recently, some folks figured out a way to use Mozilla in a mechanize driver without the need for a (visible) X-server.

    Sounds promising.

    -David

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://632829]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others surveying the Monastery: (11)
As of 2021-05-07 12:43 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Perl 7 will be out ...





    Results (91 votes). Check out past polls.

    Notices?