|
|
|
Your skill will accomplish what the force of many cannot |
|
| PerlMonks |
Re: PDF Textby hesco (Deacon) |
| on Jun 13, 2008 at 02:24 UTC ( [id://691834]=note: print w/replies, xml ) | Need Help?? |
|
I've not used it, but will underscore the recommendation for swish-e, based on what I've heard about it. But to answer your specific question, I use pdftotext to extract the ascii text from a compliant pdf file. Its a bash command line tool which is distributed with the xpdf reader application in many linux distributions. It won't work on scanned images (for which that PDF::OCR sounds particularly interesting; I'll have to check that out, ++ and thanks!). But for folks who export editable documents to PDF, it works like a charm (though is challenged a bit by multi-column content). -- Hugh
if( $lal && $lol ) { $life++; }
In Section
Seekers of Perl Wisdom
|
|
||||||||||||||||||||||||||||||||||