Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

Re: CAM::PDF extract text and their coordinates from pdf..

by LanX (Canon)
on Jan 10, 2013 at 06:34 UTC ( #1012597=note: print w/ replies, xml ) Need Help??


in reply to CAM::PDF extract text and their coordinates from pdf..

I'm normally using pdftohtml -xml for getting exact xml-formatted info about text, font and position.

for older discussions see search result:

2011-02-09 LanX Re: Need Help for Convert PDF to HTML Re:SoPW
2010-12-22 LanX Re^2: PDF File Merging Data Re:SoPW
2010-12-22 LanX Re: Convert PDF file into HTML file Re:SoPW
2010-03-28 LanX Re: How to invoke pdftotext and extract first line of text from PDF file? Re:SoPW
2010-03-26 LanX Parsing PDFs by text position? SoPW
2009-09-12 LanX Re: Convert PDF to HTML (or JPEG) (How?) Re:SoPW

Cheers Rolf


Comment on Re: CAM::PDF extract text and their coordinates from pdf..
Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1012597]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others about the Monastery: (10)
As of 2014-07-31 10:41 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite superfluous repetitious redundant duplicative phrase is:









    Results (248 votes), past polls