|
|
| Perl: the Markov chain saw | |
| PerlMonks |
Re: PDF Textby MidLifeXis (Monsignor) |
| on Jun 12, 2008 at 18:04 UTC ( [id://691749]=note: print w/replies, xml ) | Need Help?? |
|
Do a search on CPAN to see if you find anything useful there. PDF::CAM seems to have a couple of functions that might work. Extracting the layout from a PDF files into a text file might still be problematic. It will be problematic if the page does not contain text at all, but contains a graphic image of a page instead. You would need to use some sort of OCR solution then. --MidLifeXis
In Section
Seekers of Perl Wisdom
|
|
||||||||||||||||||||||||||||||