Beefy Boxes and Bandwidth Generously Provided by pair Networks
Your skill will accomplish
what the force of many cannot
 
PerlMonks  

Re^4: CAM::PDF extract text and their coordinates from pdf..

by umesh_epub (Novice)
on Jan 10, 2013 at 13:04 UTC ( #1012655=note: print w/ replies, xml ) Need Help??


in reply to Re^3: CAM::PDF extract text and their coordinates from pdf..
in thread CAM::PDF extract text and their coordinates from pdf..

Thanks David

I will look pdfminer and pstotext

I have searched pstotext in my Ghostscript "GPL Ghostscript 8.70 (2009-07-31)" But that command is not available.

In which version of the GS "pstotext" available.

Thanks,
Umesh


Comment on Re^4: CAM::PDF extract text and their coordinates from pdf..
Replies are listed 'Best First'.
Re^5: CAM::PDF extract text and their coordinates from pdf..
by snoopy (Deacon) on Jan 10, 2013 at 23:19 UTC
    Hi Umesh,

    It uses Ghostscript, but needs to be installed as a separate package. I'm running on debian which had the `pstotext` package readily available.

    But the source seems to be getting harder to find. Slackware has an archive.

    Hmm, maybe I do need to get back to work on adding word and line consolidation to PDF::ToText.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1012655]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (17)
As of 2015-07-29 13:43 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (263 votes), past polls