Beefy Boxes and Bandwidth Generously Provided by pair Networks
No such thing as a small change
 
PerlMonks  

Re: Help With Online Table Scraper

by Anonymous Monk
on Mar 02, 2011 at 05:19 UTC ( #890907=note: print w/ replies, xml ) Need Help??


in reply to Help With Online Table Scraper

  1. wget -c "http://finance.yahoo.com/q/ks?s=MNDO+Key+Statistics" -O MNDO+Key+Statistics.html
  2. perl htmltreexpather.pl MNDO+Key+Statistics.html _tag table >MNDO+Key+Statistics.xpaths.txt
  3. Then plug in the xpaths you want into this example web::scraper using an xpath

Read perlintro and References quick reference for a quick refresher.

Read http://w3schools.com/xpath/default.asp for gentle introduction to xpath.

Use     cpan Web::Scraper     or     cpanp i Web::Scraper     to install Web::Scraper


Comment on Re: Help With Online Table Scraper
Re^2: Help With Online Table Scraper
by Anonymous Monk on Mar 03, 2011 at 09:40 UTC
    Sinistral is right, use a documented API whenever available, scraping is a fragile PITA :) Come to think of it, Web::Scraper might be also be a bit of a PITA, but I've only studied the trivial examples, not the others

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://890907]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others contemplating the Monastery: (10)
As of 2014-08-22 06:33 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The best computer themed movie is:











    Results (148 votes), past polls