Apropos pdftotext. What about CAM::PDF to extract the text? As far as i remember it comes with such a feature. Regards, Karl
Update: Just found an example on my box. I didn't remember that it is so simple:
#!/usr/bin/env perl
use strict;
use warnings;
use CAM::PDF;
use feature qw(say);
my $file = shift;
my $pdf = CAM::PDF->new($file);
say $pdf->getPageText(1);
__END__
«The Crux of the Biscuit is the Apostrophe»
perl -MCrypt::CBC -E 'say Crypt::CBC->new(-key=>'kgb',-cipher=>"Blowfish")->decrypt_hex($ENV{KARL});'Help