Beefy Boxes and Bandwidth Generously Provided by pair Networks Frank
P is for Practical
 
PerlMonks  

problem HTML::FormatText::WithLinks::AndTables

by kevind0718 (Scribe)
on Dec 04, 2012 at 03:55 UTC ( #1006988=perlquestion: print w/ replies, xml ) Need Help??
kevind0718 has asked for the wisdom of the Perl Monks concerning the following question:

Hello Kind Monks:

I hope you will have time for my dilemma. I need to parse HTML tables.
Like the data found at this link: http://www.databasefootball.com/boxscores/scheduleyear.htm?yr=1985&lg=nfl
So I installed the module HTML::FormatText::WithLinks::AndTables;
Which came really close to doing what I need.
After downloading the source for the above page, I ran it through a bit of code and got the following:
1985 NFL Season Scores, Schedules and Playoffs [1]1986 1985 [2]1984 Week 1 Sunday, September 8 [1]IND 3 at [1]PIT + 45 [1]BOX + + + + + + + 1. /teams/teamyear.htm?tm=IND&lg=NFL&yr=1985 1. /teams/tea +myear.htm?tm=PIT&lg=NFL&yr=1985 1. /boxscores/gamedata.ht +m?dy=8&mth=9&yr=1985&tm=PIT&lg=NFL [1]SDG 14 at [1]BUF + 9 [1]BOX + + + + + + + 1. /teams/teamyear.htm?tm=SDG&lg=NFL&yr=1985 1. /teams/tea +myear.htm?tm=BUF&lg=NFL&yr=1985 1. /boxscores/gamedata.ht +m?dy=8&mth=9&yr=1985&tm=BUF&lg=NFL [1]DEN 16 at [1]LAM + 20 [1]BOX + + + + + + + 1. /teams/teamyear.htm?tm=DEN&lg=NFL&yr=1985 1. /teams/tea +myear.htm?tm=LAM&lg=NFL&yr=1985 1. /boxscores/gamedata.ht +m?dy=8&mth=9&yr=1985&tm=LAM&lg=NFL [1]PHI 0 at [1]NYG + 21 [1]BOX + + + + + + + 1. /teams/teamyear.htm?tm=PHI&lg=NFL&yr=1985 1. /teams/tea +myear.htm?tm=NYG&lg=NFL&yr=1985 1. /boxscores/gamedata.ht +m?dy=8&mth=9&yr=1985&tm=NYG&lg=NFL [1]SLC 27 at [1]CLE + 24 OT [1]BOX + + +

This close to what I need. Problem the URLs are striped out into "footnotes", but they are all [1]. Makes it a bit of a bummer to line up the table contents with the corresponding URL.
Can be done, but I was expecting the footnotes to come through as [1], [2], [3].
Hoping somebody has more experience with this module than I do.

thanks in advance

KD

Comment on problem HTML::FormatText::WithLinks::AndTables
Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: perlquestion [id://1006988]
Approved by Athanasius
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others examining the Monastery: (11)
As of 2014-04-17 08:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    April first is:







    Results (441 votes), past polls