Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer
 
PerlMonks  

Re: Reading PDF files

by elef (Friar)
on Apr 22, 2011 at 17:31 UTC ( #900845=note: print w/ replies, xml ) Need Help??


in reply to Reading PDF files

I'd just use XPDF. The author of CAM::Pdf himself says that the pdf->txt converter was an afterthought, not a major part of the project, and doesn't work that well with messy files. Pdftotext (part of xpdf) works better.


Comment on Re: Reading PDF files
Re: Reading PDF files
by karthikasasanka (Acolyte) on Apr 25, 2011 at 12:01 UTC
    Thanks for the response. the PDF file contains string '<</Length 6 0 R/Filter /FlateDecode>>' I think the PDF file is compressed with Deflate. So I tried below code
    use IO::Uncompress::Unzip qw(unzip $UnzipError) ; my $z = new IO::Uncompress::Unzip $in or die "unzip failed: $UnzipErro +r\n"; my $op = []; unzip $in => $op;
    And also tried using the AnyInflate and AnyUncompress. But the issue still pursues.

    can anyone tell me if you have idea about inflate and deflate.

    Thanks , Kartheek

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://900845]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others cooling their heels in the Monastery: (7)
As of 2014-11-25 01:07 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My preferred Perl binaries come from:














    Results (148 votes), past polls