Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re^2: Parse a tar.gz file without unzipping and uncompressing unzipping?

by rizzy (Sexton)
on Dec 03, 2010 at 21:18 UTC ( #875275=note: print w/ replies, xml ) Need Help??


in reply to Re: Parse a tar.gz file without unzipping and uncompressing unzipping?
in thread Parse a tar.gz file without unzipping and uncompressing unzipping?

Thanks. That's what I'm currently doing. It's manageable right now, but I figured I'd check to see if it could be further improved.


Comment on Re^2: Parse a tar.gz file without unzipping and uncompressing unzipping?
Re^3: Parse a tar.gz file without unzipping and uncompressing unzipping?
by Illuminatus (Curate) on Dec 03, 2010 at 21:32 UTC
    From your post, it seemed like you were unpacking into a filesystem. Do the majority of the files you parse each time you run the program change? If not, you could use something like like KinoSearch to allow you to quickly search files that have not changed.

    fnord

      Illuminatus, I WAS initially using Archive::Extract, but had to switch for a couple of reasons. I am on windows (because I can't convince the unix admin to allow me to store these files on the server) and since there is no bin/tar for windows, I have to use perl to do the unzipping which requires system memory. So much memory is required, that I get an "out of memory" error when extracting.

      Anyway, the files are all different in each archive.

        I've been using Cygwin for little things like this for years. Tar, gzip, zip, bzip2, etc., all work as expected from a normal Shell window (aka DOS window). If you have gzip or other compression installed, tar will support it natively, saving you some piping.

        If you want to avoid Cygwin and/or get as much performance as possible, you can also get non-Cygwin, native ports of common linux/unix utilities here.

        --marmot

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://875275]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others making s'mores by the fire in the courtyard of the Monastery: (8)
As of 2014-07-25 11:33 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite superfluous repetitious redundant duplicative phrase is:









    Results (170 votes), past polls