http://www.perlmonks.org?node_id=890907


in reply to Help With Online Table Scraper

  1. wget -c "http://finance.yahoo.com/q/ks?s=MNDO+Key+Statistics" -O MNDO+Key+Statistics.html
  2. perl htmltreexpather.pl MNDO+Key+Statistics.html _tag table >MNDO+Key+Statistics.xpaths.txt
  3. Then plug in the xpaths you want into this example web::scraper using an xpath

Read perlintro and References quick reference for a quick refresher.

Read http://w3schools.com/xpath/default.asp for gentle introduction to xpath.

Use     cpan Web::Scraper     or     cpanp i Web::Scraper     to install Web::Scraper

Replies are listed 'Best First'.
Re^2: Help With Online Table Scraper
by Anonymous Monk on Mar 03, 2011 at 09:40 UTC
    Sinistral is right, use a documented API whenever available, scraping is a fragile PITA :) Come to think of it, Web::Scraper might be also be a bit of a PITA, but I've only studied the trivial examples, not the others