Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine

Re: Accessing Meta data from MS WORD

by sundialsvc4 (Abbot)
on Aug 07, 2012 at 12:58 UTC ( #985973=note: print w/replies, xml ) Need Help??

in reply to Accessing Meta data from MS WORD

/me nods...

IIRC, docx is an XML-formatted file with a well-known public schema, zip-compressed.   If you do not already find a CPAN module to do what you want, an approach could be to write code that unzips it, then attacks the XML content using XPath expressions ... thus avoiding the need to write code to match the XML internal structure.   But it is extremely likely that what you are doing is “a thing already done.”

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://985973]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others romping around the Monastery: (4)
As of 2017-12-16 00:51 GMT
Find Nodes?
    Voting Booth?
    What programming language do you hate the most?

    Results (447 votes). Check out past polls.