Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?
 
PerlMonks  

Re: PDF extract

by jms53 (Monk)
on Mar 31, 2013 at 10:40 UTC ( #1026346=note: print w/ replies, xml ) Need Help??


in reply to PDF extract

Line 9,
my $pdf = PDF::API2->new(-file => "$0.pdf");


If your script is called awesomedoodles.pl, you will be making a pdf called awesomedoodles.pl.pdf . $0 contains the script's name. While not wrong, it reduces the usefulness of your script, as you would have to rename the script each time you want to use it.

I also can't help but notice you only open one pdf file.

J -


Comment on Re: PDF extract
Download Code
Replies are listed 'Best First'.
Re^2: PDF extract
by PerlSufi (Friar) on Mar 31, 2013 at 12:57 UTC
    Thanks J, I meant to change that. I'll continue to try and figure out extracting PDF text..
      Here is what I have so far. When I tried to run it I got the error message Can't call method "getRootDict" on an undefined value..."
      use CAM::PDF; use PDF::API2; my $file_name = shift; my $pdfone = CAM::PDF->new('pdfone.pdf'); for my $page (1 .. $pdfone->numPages()) { my $text = $pdfone->getPageText($page); @lines = split (/\n/, $text); foreach (@lines) { my $pdf = CAM::PDF->new('new.pdf'); $pdfone->appendPDF($pdf); } }

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1026346]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others meditating upon the Monastery: (6)
As of 2015-07-30 05:54 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (270 votes), past polls