Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical
 
PerlMonks  

Re: Extracting text from PDF. No really

by wazoox (Prior)
on Mar 28, 2008 at 14:23 UTC ( #676993=note: print w/replies, xml ) Need Help??


in reply to Extracting text from PDF. No really

pdftotext from poppler-0.6.4/ xpdf 3.02 gives a decent result for me:
IN THE NEWCASTLE COUNTY COURT Claim No 8NE00169 between MILLER HOMES LIMITED Claimant and EDEN PROPERTIES LIMITED Defendant Proceedings in the above matter will be heard at the Newcastle upon Tyne County Court at The Law Courts, Quayside, Newcastle upon Tyne on:− Date: 4th day of April 2008 Time: 10.30am Any person having an interest in these proceedings and intending to appear should do so on the above date. The Claimant’s solicitors are Ward Hadaway of Sandgate House, 102 Quayside, Newcastle upon Tyne, NE1 3DX, tel: 0191 204 4000, ref: EF.JJ.MIL181.2751

Replies are listed 'Best First'.
Re^2: Extracting text from PDF. No really
by clinton (Priest) on Mar 28, 2008 at 14:35 UTC
    wazoox, you're a **star** - I had version 3.01 of xpdf installed - upgrading to 3.02 fixed that issue.

    many thanks!

      I can only concur - this utility is brilliant and works much better than any of the Perl modules I have come across so far. Thanks so much for bringing it up! I will investigate it in further detail from now on.

      Cheers -

      Pat

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://676993]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others perusing the Monastery: (3)
As of 2023-03-21 09:00 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Which type of climate do you prefer to live in?






    Results (59 votes). Check out past polls.

    Notices?