http://www.perlmonks.org?node_id=900845


in reply to Reading PDF files

I'd just use XPDF. The author of CAM::Pdf himself says that the pdf->txt converter was an afterthought, not a major part of the project, and doesn't work that well with messy files. Pdftotext (part of xpdf) works better.

Replies are listed 'Best First'.
Re: Reading PDF files
by karthikasasanka (Acolyte) on Apr 25, 2011 at 12:01 UTC
    Thanks for the response. the PDF file contains string '<</Length 6 0 R/Filter /FlateDecode>>' I think the PDF file is compressed with Deflate. So I tried below code
    use IO::Uncompress::Unzip qw(unzip $UnzipError) ; my $z = new IO::Uncompress::Unzip $in or die "unzip failed: $UnzipErro +r\n"; my $op = []; unzip $in => $op;
    And also tried using the AnyInflate and AnyUncompress. But the issue still pursues.

    can anyone tell me if you have idea about inflate and deflate.

    Thanks , Kartheek