Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

Re: parse content of PDF file

by archfool (Monk)
on Aug 03, 2007 at 13:50 UTC ( [id://630510]=note: print w/replies, xml ) Need Help??


in reply to parse content of PDF file

If there were any reasonable way to do it, the software would cost a lot. Your key here was _scanned_. This means Optical Character Recognition (OCR), a very imperfect science at the moment. You will need OCR software, and there's very little free OCR software out there, let alone any Perl bindings to it.

You'll need to convert the PDF to text with some OCR software FIRST. THEN running perl against it will be easy.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://630510]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others about the Monastery: (5)
As of 2024-04-19 16:41 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found