Beefy Boxes and Bandwidth Generously Provided by pair Networks
The stupid question is the question not asked
 
PerlMonks  

tagging PDFs with keywords and filtering them later

by LanX (Canon)
on Nov 19, 2012 at 18:13 UTC ( #1004599=perlquestion: print w/ replies, xml ) Need Help??
LanX has asked for the wisdom of the Perl Monks concerning the following question:

Hi

I'm having a big amount of PDF documents in a directory from various sources (word, scan, latex) which I would like to organize.

More or less the same way I can find PDFs with embedded text by using grep.

My idea is to add keywords/tags and to use an application to filter them by those keywords.

Any suggestions how to do this from command line, preferably with Perl?

Cheers Rolf

Comment on tagging PDFs with keywords and filtering them later
Download Code
Re: tagging PDFs with keywords and filtering them later
by Anonymous Monk on Nov 19, 2012 at 18:18 UTC
Re: tagging PDFs with keywords and filtering them later
by snoopy (Deacon) on Nov 19, 2012 at 21:38 UTC
    Image::ExifTool is a module for adding keywords and other searchable metadata to PDF files and various other image formats.

    You can use the OO interface, or its command line utility exiftool

    % # add keywords to a PDF % exiftool -keywords=perl -keywords=snoopy test.pdf % % # read them back % exiftool -keywords test.pdf Keywords : perl, snoopy
      Awesome! Thanks :)

      Cheers Rolf

      PS: CPAN rules! Really! =)

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: perlquestion [id://1004599]
Approved by Athanasius
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others browsing the Monastery: (12)
As of 2014-12-22 20:53 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (130 votes), past polls