Beefy Boxes and Bandwidth Generously Provided by pair Networks
Syntactic Confectionery Delight
 
PerlMonks  

Re: PDF extract

by jms53 (Monk)
on Mar 31, 2013 at 10:40 UTC ( #1026346=note: print w/ replies, xml ) Need Help??


in reply to PDF extract

Line 9,
my $pdf = PDF::API2->new(-file => "$0.pdf");


If your script is called awesomedoodles.pl, you will be making a pdf called awesomedoodles.pl.pdf . $0 contains the script's name. While not wrong, it reduces the usefulness of your script, as you would have to rename the script each time you want to use it.

I also can't help but notice you only open one pdf file.

J -


Comment on Re: PDF extract
Download Code
Re^2: PDF extract
by PerlSufi (Friar) on Mar 31, 2013 at 12:57 UTC
    Thanks J, I meant to change that. I'll continue to try and figure out extracting PDF text..
      Here is what I have so far. When I tried to run it I got the error message Can't call method "getRootDict" on an undefined value..."
      use CAM::PDF; use PDF::API2; my $file_name = shift; my $pdfone = CAM::PDF->new('pdfone.pdf'); for my $page (1 .. $pdfone->numPages()) { my $text = $pdfone->getPageText($page); @lines = split (/\n/, $text); foreach (@lines) { my $pdf = CAM::PDF->new('new.pdf'); $pdfone->appendPDF($pdf); } }

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1026346]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (18)
As of 2015-07-01 18:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (16 votes), past polls