Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic

pdftotext pass options

by welle (Beadle)
on May 22, 2013 at 17:02 UTC ( #1034789=perlquestion: print w/ replies, xml ) Need Help??
welle has asked for the wisdom of the Perl Monks concerning the following question:

Hi monks

On a Windows machine I am using pdftotext to bunch convert pdf files into plain text. Using the following, I get no problems:

system("$path/pdftotext","-nopgbrk","$path_my_pdf","$path_my_pdf.txt") +;

I now want to add the encoding option "-enc UTF-8", so I try

system("$path/pdftotext","-enc UTF-8 -nopgbrk","$path_my_pdf","$path_m +y_pdf.txt"); or simply system("$path/pdftotext","-enc UTF-8","$path_my_pdf","$path_my_pdf.txt +");

but it doesn't work. What am I missing? Thanks


system("$path/pdftotext","-enc", "UTF-8","$path_my_pdf","$path_my_pdf. +txt");

Comment on pdftotext pass options
Select or Download Code
Replies are listed 'Best First'.
Re: pdftotext pass options
by Not_a_Number (Parson) on May 22, 2013 at 18:06 UTC

    According to the pdftotext man page, -enc defaults to "UTF-8" anyway.

    So maybe you don't need to bother?

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: perlquestion [id://1034789]
Approved by Happy-the-monk
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others avoiding work at the Monastery: (5)
As of 2015-11-26 01:25 GMT
Find Nodes?
    Voting Booth?

    What would be the most significant thing to happen if a rope (or wire) tied the Earth and the Moon together?

    Results (695 votes), past polls