Beefy Boxes and Bandwidth Generously Provided by pair Networks
laziness, impatience, and hubris
 
PerlMonks  

Re: Help With Online Table Scraper

by Anonymous Monk
on Mar 02, 2011 at 05:19 UTC ( #890907=note: print w/ replies, xml ) Need Help??


in reply to Help With Online Table Scraper

  1. wget -c "http://finance.yahoo.com/q/ks?s=MNDO+Key+Statistics" -O MNDO+Key+Statistics.html
  2. perl htmltreexpather.pl MNDO+Key+Statistics.html _tag table >MNDO+Key+Statistics.xpaths.txt
  3. Then plug in the xpaths you want into this example web::scraper using an xpath

Read perlintro and References quick reference for a quick refresher.

Read http://w3schools.com/xpath/default.asp for gentle introduction to xpath.

Use     cpan Web::Scraper     or     cpanp i Web::Scraper     to install Web::Scraper


Comment on Re: Help With Online Table Scraper
Replies are listed 'Best First'.
Re^2: Help With Online Table Scraper
by Anonymous Monk on Mar 03, 2011 at 09:40 UTC
    Sinistral is right, use a documented API whenever available, scraping is a fragile PITA :) Come to think of it, Web::Scraper might be also be a bit of a PITA, but I've only studied the trivial examples, not the others

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://890907]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others avoiding work at the Monastery: (5)
As of 2015-07-30 01:19 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (269 votes), past polls