go ahead... be a heretic | |
PerlMonks |
comment on |
( [id://3333]=superdoc: print w/replies, xml ) | Need Help?? |
I think one of the reasons Spreadsheet::XLSX is so slow, is that it doesn't use a proper XML parser, but parses the workbook(s) using regular expressions. And over that, it uses:
to be Spreadsheet::ParseExcel compatible (which it really is not. In most Spreadsheet modules, the whole spreadsheet (file) is read into memory, as there are several formats to be parsed before one can get to the actual data (ZIP, binary, ...). If the spreadsheet would be readable directly from file (like CSV, if you want to call that a spreadsheet), parsing could be a lot faster. If someone would (re)write this module using a proper (fast) XML parser, preferably with the option to select whatever (working) XML parser is installed, that would really help this module. I really mean option here, as making the module require XML::libXML would mean its death, as XML::libXML depends on libxml2, which might prove very hard to port on some non-standardish systems. So the module should choose between XML::libXML, XML::Parser, XML::Parser::Lite, XML::Simple, or XML::Twig (and even those might he depending on each other). Enjoy, Have FUN! H.Merijn In reply to Re: Speeding up Spreadsheet::XLSX file load in UNIX
by Tux
|
|