Don't ask to ask, just ask | |
PerlMonks |
Re^2: Extracting text from MS Word files on a Linux boxby afoken (Chancellor) |
on Jun 21, 2018 at 20:18 UTC ( [id://1217133]=note: print w/replies, xml ) | Need Help?? |
Have you tried strings? Always used to do the trick before the MS format changed. docx is just a bunch of zipped XML files and some misc files. strings will fail due to ZIP, but once unpacked, strings will happily dig through the XML files. Alexander
-- Today I will gladly share my knowledge and experience, for there are no sweeter words than "I told you so". ;-)
In Section
Seekers of Perl Wisdom
|
|