Thanks guys! You have all been really helpful and really nice! I like this site so much I've taken the good advice and registered myself.
I've been reading up on WWW::Mechanize. Sadly, the website in question IS in Javascript. I haven't given up yet. I'm going to spend some more time looking up as much info as I can.
I also have acquired a .pdf version of the web data. I'm doing the best I can , trying to tackle this with straight Perl or trying to convert and extract info from the .pdf .
Hopefully, I will be successful.
As to quitting...I will not be doing this forever. I will not be doing this forever. I will not be doing this forever.
Many thanks, and of course further advice is welcomed!
Telly | [reply] |
If you have a pdf of the data you need try pdftotext, which is a part of the xpdf library. It does a great job of extracting useful text from any pdf not based on a scan of an image. Then you'll have plain text to parse into fields. That would save the WWW::Mechanize learning curve and the javascript frustrations it raises. -- Hugh
| [reply] |