Beefy Boxes and Bandwidth Generously Provided by pair Networks
"be consistent"
 
PerlMonks  

Re: Extracting information from a PDF file

by Popcorn Dave (Abbot)
on Aug 20, 2008 at 20:31 UTC ( #705615=note: print w/ replies, xml ) Need Help??


in reply to [Updated] Extracting information from a PDF file

There is a non-Perl way to do it, depending on what you're after and how many files you have. Adobe's website offers that as a free service, or at least they used to, so if you're having problems you might check that out as well.


Revolution. Today, 3 O'Clock. Meet behind the monkey bars.

I would love to change the world, but they won't give me the source code


Comment on Re: Extracting information from a PDF file
Re^2: Extracting information from a PDF file
by Lawliet (Curate) on Aug 20, 2008 at 20:35 UTC

    Just one file. I need to upload the data I extract to a database. I'll try and see what I can find on their website.

    Update: Do you mean they can extract information or convert the file? :\

    I'm so adjective, I verb nouns!

    chomp; # nom nom nom

      Been a while since I needed it, but as I remember you give them a link to your file and then they send you back the text via e-mail. Hopefully it will do what you want.


      Revolution. Today, 3 O'Clock. Meet behind the monkey bars.

      I would love to change the world, but they won't give me the source code

        Ah, thank you. I assume this is what you are referring to?

        I'm so adjective, I verb nouns!

        chomp; # nom nom nom

Re^2: Extracting information from a PDF file
by Your Mother (Chancellor) on Aug 20, 2008 at 22:30 UTC

    IIRC, Gmail will parse PDF attachments out for display as HTML too... I think I'm remembering right. It was a few months ago that I was playing with it and Adobe's service was either super slow or down. Obviously can't speak to the parse quality.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://705615]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others perusing the Monastery: (7)
As of 2015-07-02 23:24 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (47 votes), past polls