Beefy Boxes and Bandwidth Generously Provided by pair Networks Joe
Don't ask to ask, just ask
 
PerlMonks  

Re: Perl variant of linux tool strings

by duct_tape (Hermit)
on Mar 23, 2005 at 15:46 UTC ( [id://441878]=note: print w/replies, xml ) Need Help??

This is an archived low-energy page for bots and other anonmyous visitors. Please sign up if you are a human and want to interact.


in reply to Perl variant of linux tool strings

Not a module, but there are some versions of the 'strings' tool done for the Perl Power Tools project.

http://ppt.perl.org/commands/strings/index.html

Replies are listed 'Best First'.
Re^2: Perl variant of linux tool strings
by jeanluca (Deacon) on Mar 23, 2005 at 16:04 UTC
    I like to collect words from a pdf or word document! So far Perl Power tools does a very good job! Thanks

      For collecting words from pdf documents, you can use the ps2ascii utility which comes with ghostscript. It executes the document with ghostscript, using a special device that outputs only ascii text. As ghostscript can handle pdfs too, ps2ascii works fine on them (although I did have some compatibility problems with some pdfs, depending on the generating program and the version of ghostscript).

      This doesn't work for word documents of course.

        OP, you may have some luck loading MS Word into (star|open)office, printing to pdf then chucking it at ps2ascii. As it is the exact same formating that is hardest for *office to get correct and ascii has little remmenant of these I guess you could have a lot of luck.

        update

        As ambrus points out below of course if you can read the word doc into *office then you can just export ASCII from there. Sorry, it has been a rather long day

        You may also want to trawl through a list of filters, I found this one which looks like it may have some tools that could help

        Cheers,
        R.

        Pereant, qui ante nos nostra dixerunt!

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://441878]
help
Sections?
Information?
Find Nodes?
Leftovers?
    Notices?
    hippoepoptai's answer Re: how do I set a cookie and redirect was blessed by hippo!
    erzuuliAnonymous Monks are no longer allowed to use Super Search, due to an excessive use of this resource by robots.