Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

Grab a Textbook's Chapter Summaries from web

by silent11 (Vicar)
on Jan 21, 2004 at 16:54 UTC ( #322944=CUFP: print w/ replies, xml ) Need Help??

This tool allows one to grab all of a Textbook's Chapter Summaries from the web, downloading them into one easy to manage HTML file. Your book must of course be available on McGraw-Hill's website.

usage:
perl summary.pl > summary.html

#!/usr/bin/perl use strict; use HTML::TableExtract; use LWP::Simple; use CGI; use constant CHAPTERS => 14; use constant TITLE => 'Communicating at Work: Principles and Practices + for Business and Professions, 7/e'; my $c = new CGI; print $c->h1( TITLE ); for ( 1..CHAPTERS ){ my $count = $_; my $URL = 'http://highered.mcgraw-hill.com/sites/0072400722/student_ +view0/chapter' . $_ . '/chapter_overview.html'; my $HTML = get($URL) or die $!; my $te = new HTML::TableExtract( depth => 2 , count => 1); $te->parse($HTML); foreach my $ts ($te->table_states) { print $c->h3(" Chapter $count Overview"); foreach my $row ($ts->rows) { print $c->p( join(',', @$row), "\n" ); } } }


-silent11

Comment on Grab a Textbook's Chapter Summaries from web
Select or Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: CUFP [id://322944]
Approved by TStanley
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others chilling in the Monastery: (5)
As of 2014-07-13 15:55 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    When choosing user names for websites, I prefer to use:








    Results (250 votes), past polls