extract text from pdfby jeteve (Pilgrim)
|on Nov 08, 2006 at 12:30 UTC||Need Help??|
jeteve has asked for the
wisdom of the Perl Monks concerning the following question:
Hi wise monks.
I wonder what is the simpliest solution to extract text from pdf in perl.
Of course I can use pdftotext in command line, but it involves managing temporary files ..
So I'm looking for a pure perl solution (or linked to a C library)..
I had a look at PDF::API2 , but it's more dedicated to creation.
CAM::PDF seammt to fill my need, but I can't manage to use it to extract the text ..
I also had a look at SWISH, but it internally uses ... pdftotext :) ..
Any Idea ?
-- Nice photos of naked perl sources here !