Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: Comparison word against pdf

by thezip (Vicar)
on Apr 16, 2013 at 18:59 UTC ( #1028988=note: print w/ replies, xml ) Need Help??


in reply to Comparison word against pdf

I've done some rudimentary parsing of PDF's using CAM::PDF's getPageText() method, but I was only able to deal with PDF v1.4 formatted files though (v1.5 and v1.6 I couldn't parse).

I have not done anything similar in Word, but there must be something around that performs a similar extraction function.

Once you've extracted each file, then you'd need to write the comparator function.


What can be asserted without proof can be dismissed without proof. - Christopher Hitchens, 1949-2011


Comment on Re: Comparison word against pdf
Re^2: Comparison word against pdf
by hdb (Parson) on Apr 16, 2013 at 19:04 UTC

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1028988]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others examining the Monastery: (4)
As of 2014-07-10 03:12 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    When choosing user names for websites, I prefer to use:








    Results (198 votes), past polls