Beefy Boxes and Bandwidth Generously Provided by pair Networks
Don't ask to ask, just ask
 
PerlMonks  

Re: Spreadsheet::ParseExcel with embedded PDF cells

by Corion (Pope)
on Jan 09, 2009 at 13:38 UTC ( #735173=note: print w/ replies, xml ) Need Help??


in reply to Spreadsheet::ParseExcel with embedded PDF cells

As far as I understood OLE embedding and OLE Document Streams (and that never was really far), an embedded object is mostly a screenshot of the representation and a serialized opaque blob of the embedded object. The target machine hopefully has the same OLE-enabled application (that is, Acrobat Reader in your case), and that application is basically called upon to refresh the "preview" bitmap or edit the embedded object if necessary.

At least from this rudimentary understanding, I doubt that there will be a generic way to get at the "embedded file", as the data blob is mostly opaque. Possibly you can write a special case hack for getting at the PDF, as the PDF likely has a distinctive start signature (%%) and contains mostly text and gzipped blobs.


Comment on Re: Spreadsheet::ParseExcel with embedded PDF cells
Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://735173]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others musing on the Monastery: (12)
As of 2014-10-21 09:02 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    For retirement, I am banking on:










    Results (99 votes), past polls