|
|
| The stupid question is the question not asked | |
| PerlMonks |
Re: Removing Junk from Filesby idnopheq (Chaplain) |
| on Apr 12, 2001 at 15:19 UTC ( [id://72072]=note: print w/replies, xml ) | Need Help?? |
This is an archived low-energy page for bots and other anonmyous visitors. Please sign up if you are a human and want to interact.
Well, for your WinWord .doc files, try RTF::Parser. Some of your other file formats may well have parser modules available.
For quick and dirty, sometimes I'll save a file in html via the app and then html2txt it for the output. Lazy? Yes! But, I did automate the process via Win32::OLE. Anywho, I'm changing employers today (hurrah!), so I can't provide my script just now.
HTH
UPDATE: Check out iguane's WORD TO TEXT SIMPLY for the OLE stuff I mentioned.
In Section
Seekers of Perl Wisdom
|
|
||||||||||||||||||||||||