Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid

Re: Reading PDF files

by elef (Friar)
on Apr 22, 2011 at 17:31 UTC ( #900845=note: print w/replies, xml ) Need Help??

in reply to Reading PDF files

I'd just use XPDF. The author of CAM::Pdf himself says that the pdf->txt converter was an afterthought, not a major part of the project, and doesn't work that well with messy files. Pdftotext (part of xpdf) works better.

Replies are listed 'Best First'.
Re: Reading PDF files
by karthikasasanka (Acolyte) on Apr 25, 2011 at 12:01 UTC
    Thanks for the response. the PDF file contains string '<</Length 6 0 R/Filter /FlateDecode>>' I think the PDF file is compressed with Deflate. So I tried below code
    use IO::Uncompress::Unzip qw(unzip $UnzipError) ; my $z = new IO::Uncompress::Unzip $in or die "unzip failed: $UnzipErro +r\n"; my $op = []; unzip $in => $op;
    And also tried using the AnyInflate and AnyUncompress. But the issue still pursues.

    can anyone tell me if you have idea about inflate and deflate.

    Thanks , Kartheek

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://900845]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others contemplating the Monastery: (5)
As of 2018-05-25 19:04 GMT
Find Nodes?
    Voting Booth?