Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic

Re: Comparison word against pdf

by rpnoble419 (Pilgrim)
on Apr 16, 2013 at 19:18 UTC ( #1028993=note: print w/ replies, xml ) Need Help??

in reply to Comparison word against pdf

Because of how text is generated in PDF file this will be a next to impossible task. What may look like a complete word in the PDF file may actually be a combination of many letters or groups of letters. Also text does not flow in the same manner as in word.

You can improve your chances of success if you know exactly how the PDF files were created and by what application. If you have access to Adobe Illustrator, you can import the PDF files and see how each page is constructed and this may give you insight in to how to read the PDF objects to extract the text.

Comment on Re: Comparison word against pdf

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1028993]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others cooling their heels in the Monastery: (8)
As of 2015-07-28 09:00 GMT
Find Nodes?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...

    Results (254 votes), past polls