Beefy Boxes and Bandwidth Generously Provided by pair Networks
Don't ask to ask, just ask
 
PerlMonks  

Re^3: Extract data from website and transfer it to Outlook

by Paintermonkey (Novice)
on Apr 27, 2006 at 23:29 UTC ( [id://546143]=note: print w/replies, xml ) Need Help??


in reply to Re^2: Extract data from website and transfer it to Outlook
in thread Extract data from website and transfer it to Outlook

Thanks guys! You have all been really helpful and really nice! I like this site so much I've taken the good advice and registered myself.

I've been reading up on WWW::Mechanize. Sadly, the website in question IS in Javascript. I haven't given up yet. I'm going to spend some more time looking up as much info as I can.

I also have acquired a .pdf version of the web data. I'm doing the best I can , trying to tackle this with straight Perl or trying to convert and extract info from the .pdf .

Hopefully, I will be successful.

As to quitting...I will not be doing this forever. I will not be doing this forever. I will not be doing this forever.

Many thanks, and of course further advice is welcomed!

Telly
  • Comment on Re^3: Extract data from website and transfer it to Outlook

Replies are listed 'Best First'.
Re^4: Extract data from website and transfer it to Outlook
by hesco (Deacon) on Apr 28, 2006 at 04:45 UTC
    If you have a pdf of the data you need try pdftotext, which is a part of the xpdf library. It does a great job of extracting useful text from any pdf not based on a scan of an image. Then you'll have plain text to parse into fields. That would save the WWW::Mechanize learning curve and the javascript frustrations it raises. -- Hugh

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://546143]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others pondering the Monastery: (5)
As of 2024-04-24 00:34 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found