|Perl: the Markov chain saw|
Macintosh PDF's on Windowsby HamNRye (Monk)
|on Mar 01, 2007 at 17:51 UTC||Need Help??|
HamNRye has asked for the
wisdom of the Perl Monks concerning the following question:
I have a program that crops PDF pages. Basically, we scan the file for the MediaBox area, and can enter our crop values as ASCII text. It all works great.
However, we came across a PDF we can't handle. The reason, it's from a mac. The line ends appear to be mostly "0D" as opposed to "0A". So, when I open the file and resave it, even in binmode, the file is mangled.
I'll try to explain this with as little BS as possible...
A sample of the two files:
I have tried the usual "convert mac to dos" line ends and it screws up the xref table. Anyone have any experience with MAC PDF's and manipulating them on Windows?
perl -p -e 's/(\r\n|\n|\r)/\r/g' inputfile > outputfile