|
|
| laziness, impatience, and hubris | |
| PerlMonks |
Re: Spreadsheet::ParseExcel with embedded PDF cellsby jmcnamara (Monsignor) |
| on Jan 09, 2009 at 16:27 UTC ( #735234=note: print w/ replies, xml ) | Need Help?? |
|
As such Spreadsheet::ParseExcel isn't of any use in this case. If you want to extract the PDF files you will need to use OLE::Storage_Lite. The first thing you will need to find out is the PPS (property set) name of the embedded objects. The smplls.pl utility that is part of the OLE::Storage_Lite will show you the File structure and the PPS names. For example:
Then you can extract the PPS structures using OLE::Storage_Lite. Here is a sample program that extracts the "Summary Information" from an Excel file to get you started. Note, if the PPS name appears to start with a space it may actually be a low ordinal character such as "\0", "\1" or as in the case above "\5".
--
In Section
Seekers of Perl Wisdom
|
|
||||||||||||||||||||||||||