Beefy Boxes and Bandwidth Generously Provided by pair Networks
We don't bite newbies here... much
 
PerlMonks  

Re: Text from PDF

by steves (Curate)
on Oct 27, 2004 at 10:08 UTC ( #402941=note: print w/ replies, xml ) Need Help??


in reply to Text from PDF

I played around with PDF::FDF::Simple and I couldn't get it to extract text from PDF files. I thought that FDF was just a subset of PDF but there must be more to it than that. Then I looked around for free PDF-to-text tools and was surprised to find that there aren't many that are truly free. Ghostscript may be your best free option. It apparently has a tool for getting text from PDF documents. Another one I found is a Java tool named PDFBox.


Comment on Re: Text from PDF

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://402941]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others making s'mores by the fire in the courtyard of the Monastery: (4)
As of 2015-07-06 01:55 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (68 votes), past polls