Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical
 
PerlMonks  

Re: Reading PDF files

by elef (Friar)
on Apr 22, 2011 at 17:31 UTC ( [id://900845]=note: print w/replies, xml ) Need Help??


in reply to Reading PDF files

I'd just use XPDF. The author of CAM::Pdf himself says that the pdf->txt converter was an afterthought, not a major part of the project, and doesn't work that well with messy files. Pdftotext (part of xpdf) works better.

Replies are listed 'Best First'.
Re: Reading PDF files
by karthikasasanka (Acolyte) on Apr 25, 2011 at 12:01 UTC
    Thanks for the response. the PDF file contains string '<</Length 6 0 R/Filter /FlateDecode>>' I think the PDF file is compressed with Deflate. So I tried below code
    use IO::Uncompress::Unzip qw(unzip $UnzipError) ; my $z = new IO::Uncompress::Unzip $in or die "unzip failed: $UnzipErro +r\n"; my $op = []; unzip $in => $op;
    And also tried using the AnyInflate and AnyUncompress. But the issue still pursues.

    can anyone tell me if you have idea about inflate and deflate.

    Thanks , Kartheek

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://900845]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others about the Monastery: (4)
As of 2024-03-19 06:45 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found