tagging PDFs with keywords and filtering them later

by LanX (Chancellor)
on Nov 19, 2012 at 18:13 UTC
LanX has asked for the wisdom of the Perl Monks concerning the following question:


I'm having a big amount of PDF documents in a directory from various sources (word, scan, latex) which I would like to organize.

More or less the same way I can find PDFs with embedded text by using grep.

My idea is to add keywords/tags and to use an application to filter them by those keywords.

Any suggestions how to do this from command line, preferably with Perl?

Cheers Rolf

Re: tagging PDFs with keywords and filtering them later
by snoopy (Deacon) on Nov 19, 2012 at 21:38 UTC
    Image::ExifTool is a module for adding keywords and other searchable metadata to PDF files and various other image formats.

    You can use the OO interface, or its command line utility exiftool

    % # add keywords to a PDF % exiftool -keywords=perl -keywords=snoopy test.pdf % % # read them back % exiftool -keywords test.pdf Keywords : perl, snoopy
      Awesome! Thanks :)

      Cheers Rolf

      PS: CPAN rules! Really! =)

Re: tagging PDFs with keywords and filtering them later
by Anonymous Monk on Nov 19, 2012 at 18:18 UTC

Node Type: perlquestion
