Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?
 
PerlMonks  

Re: Help With Online Table Scraper

by Anonymous Monk
on Mar 02, 2011 at 05:19 UTC ( #890907=note: print w/replies, xml ) Need Help??


in reply to Help With Online Table Scraper

  1. wget -c "http://finance.yahoo.com/q/ks?s=MNDO+Key+Statistics" -O MNDO+Key+Statistics.html
  2. perl htmltreexpather.pl MNDO+Key+Statistics.html _tag table >MNDO+Key+Statistics.xpaths.txt
  3. Then plug in the xpaths you want into this example web::scraper using an xpath

Read perlintro and References quick reference for a quick refresher.

Read http://w3schools.com/xpath/default.asp for gentle introduction to xpath.

Use     cpan Web::Scraper     or     cpanp i Web::Scraper     to install Web::Scraper

Replies are listed 'Best First'.
Re^2: Help With Online Table Scraper
by Anonymous Monk on Mar 03, 2011 at 09:40 UTC
    Sinistral is right, use a documented API whenever available, scraping is a fragile PITA :) Come to think of it, Web::Scraper might be also be a bit of a PITA, but I've only studied the trivial examples, not the others

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://890907]
help
Chatterbox?
[Lady_Aleena]: Discipulus, print working directory?
[Corion]: "P(ass)w(or)d" ;)
[Lady_Aleena]: Corion, I knew that, I was just giving Discipulus a hard time.
choroba likes the story at the end of the node
[choroba]: reminds me of the days at the bank

How do I use this? | Other CB clients
Other Users?
Others musing on the Monastery: (5)
As of 2017-04-26 08:24 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    I'm a fool:











    Results (471 votes). Check out past polls.