Beefy Boxes and Bandwidth Generously Provided by pair Networks Joe
The stupid question is the question not asked
 
PerlMonks  

Re: Removing Junk from Files

by idnopheq (Chaplain)
on Apr 12, 2001 at 15:19 UTC ( [id://72072]=note: print w/replies, xml ) Need Help??

This is an archived low-energy page for bots and other anonmyous visitors. Please sign up if you are a human and want to interact.


in reply to Removing Junk from Files

Well, for your WinWord .doc files, try RTF::Parser. Some of your other file formats may well have parser modules available.

For quick and dirty, sometimes I'll save a file in html via the app and then html2txt it for the output. Lazy? Yes! But, I did automate the process via Win32::OLE.

Anywho, I'm changing employers today (hurrah!), so I can't provide my script just now.

HTH
--
idnopheq
Apply yourself to new problems without preparation, develop confidence in your ability to to meet situations as they arrise.

UPDATE: Check out iguane's WORD TO TEXT SIMPLY for the OLE stuff I mentioned.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://72072]
help
Sections?
Information?
Find Nodes?
Leftovers?
    Notices?
    hippoepoptai's answer Re: how do I set a cookie and redirect was blessed by hippo!
    erzuuliAnonymous Monks are no longer allowed to use Super Search, due to an excessive use of this resource by robots.