Beefy Boxes and Bandwidth Generously Provided by pair Networks
Welcome to the Monastery
 
PerlMonks  

Re: Extracting information from a PDF file

by Popcorn Dave (Abbot)
on Aug 20, 2008 at 20:31 UTC ( #705615=note: print w/ replies, xml ) Need Help??


in reply to [Updated] Extracting information from a PDF file

There is a non-Perl way to do it, depending on what you're after and how many files you have. Adobe's website offers that as a free service, or at least they used to, so if you're having problems you might check that out as well.


Revolution. Today, 3 O'Clock. Meet behind the monkey bars.

I would love to change the world, but they won't give me the source code


Comment on Re: Extracting information from a PDF file
Re^2: Extracting information from a PDF file
by Lawliet (Curate) on Aug 20, 2008 at 20:35 UTC

    Just one file. I need to upload the data I extract to a database. I'll try and see what I can find on their website.

    Update: Do you mean they can extract information or convert the file? :\

    I'm so adjective, I verb nouns!

    chomp; # nom nom nom

      Been a while since I needed it, but as I remember you give them a link to your file and then they send you back the text via e-mail. Hopefully it will do what you want.


      Revolution. Today, 3 O'Clock. Meet behind the monkey bars.

      I would love to change the world, but they won't give me the source code

        Ah, thank you. I assume this is what you are referring to?

        I'm so adjective, I verb nouns!

        chomp; # nom nom nom

Re^2: Extracting information from a PDF file
by Your Mother (Canon) on Aug 20, 2008 at 22:30 UTC

    IIRC, Gmail will parse PDF attachments out for display as HTML too... I think I'm remembering right. It was a few months ago that I was playing with it and Adobe's service was either super slow or down. Obviously can't speak to the parse quality.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://705615]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others contemplating the Monastery: (9)
As of 2014-08-20 16:01 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The best computer themed movie is:











    Results (118 votes), past polls