Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid
 
PerlMonks  

Re^2: Extracting text from PDF. No really

by clinton (Priest)
on Mar 29, 2008 at 13:04 UTC ( #677222=note: print w/ replies, xml ) Need Help??


in reply to Re: Extracting text from PDF. No really
in thread Extracting text from PDF. No really

Thanks for responding, Chris. You'd be interested to know (as I mentioned in the OP), that the question "how do I extract text from a PDF" comes up a lot, and that the standard answer is always CAM::PDF.

After your response, it seems that there is no Perl module for reading/rendering PDFs, and that about the only reliable OOS way to do it is via pdftotext from either Xpdf or Poppler.

Clint


Comment on Re^2: Extracting text from PDF. No really
Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://677222]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others pondering the Monastery: (5)
As of 2014-12-26 04:35 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (165 votes), past polls