in reply to Comparison word against pdf
I've done some rudimentary parsing of PDF's using CAM::PDF's getPageText() method, but I was only able to deal with PDF v1.4 formatted files though (v1.5 and v1.6 I couldn't parse).
I have not done anything similar in Word, but there must be something around that performs a similar extraction function.
Once you've extracted each file, then you'd need to write the comparator function.
What can be asserted without proof can be dismissed without proof. - Christopher Hitchens, 1949-2011
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^2: Comparison word against pdf
by hdb (Monsignor) on Apr 16, 2013 at 19:04 UTC |
In Section
Seekers of Perl Wisdom