Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: PDF extract

by jms53 (Monk)
on Mar 31, 2013 at 10:40 UTC ( #1026346=note: print w/ replies, xml ) Need Help??


in reply to PDF extract

Line 9,
my $pdf = PDF::API2->new(-file => "$0.pdf");


If your script is called awesomedoodles.pl, you will be making a pdf called awesomedoodles.pl.pdf . $0 contains the script's name. While not wrong, it reduces the usefulness of your script, as you would have to rename the script each time you want to use it.

I also can't help but notice you only open one pdf file.

J -


Comment on Re: PDF extract
Download Code
Re^2: PDF extract
by PerlSufi (Pilgrim) on Mar 31, 2013 at 12:57 UTC
    Thanks J, I meant to change that. I'll continue to try and figure out extracting PDF text..
      Here is what I have so far. When I tried to run it I got the error message Can't call method "getRootDict" on an undefined value..."
      use CAM::PDF; use PDF::API2; my $file_name = shift; my $pdfone = CAM::PDF->new('pdfone.pdf'); for my $page (1 .. $pdfone->numPages()) { my $text = $pdfone->getPageText($page); @lines = split (/\n/, $text); foreach (@lines) { my $pdf = CAM::PDF->new('new.pdf'); $pdfone->appendPDF($pdf); } }

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1026346]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others taking refuge in the Monastery: (11)
As of 2014-10-01 21:38 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    What is your favourite meta-syntactic variable name?














    Results (38 votes), past polls