Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: Help With Online Table Scraper

by Anonymous Monk
on Mar 02, 2011 at 05:19 UTC ( [id://890907]=note: print w/replies, xml ) Need Help??


in reply to Help With Online Table Scraper

  1. wget -c "http://finance.yahoo.com/q/ks?s=MNDO+Key+Statistics" -O MNDO+Key+Statistics.html
  2. perl htmltreexpather.pl MNDO+Key+Statistics.html _tag table >MNDO+Key+Statistics.xpaths.txt
  3. Then plug in the xpaths you want into this example web::scraper using an xpath

Read perlintro and References quick reference for a quick refresher.

Read http://w3schools.com/xpath/default.asp for gentle introduction to xpath.

Use     cpan Web::Scraper     or     cpanp i Web::Scraper     to install Web::Scraper

Replies are listed 'Best First'.
Re^2: Help With Online Table Scraper
by Anonymous Monk on Mar 03, 2011 at 09:40 UTC
    Sinistral is right, use a documented API whenever available, scraping is a fragile PITA :) Come to think of it, Web::Scraper might be also be a bit of a PITA, but I've only studied the trivial examples, not the others

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://890907]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others learning in the Monastery: (6)
As of 2024-03-19 10:07 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found