Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine

Re^4: Split a file based on column

by davido (Archbishop)
on Jan 17, 2013 at 18:56 UTC ( #1013856=note: print w/replies, xml ) Need Help??

in reply to Re^3: Split a file based on column
in thread Split a file based on column

Loading a 19GB file into memory does indeed give pause for thought.... long long pause. :) Time enough to contemplate approaches that do scale well.

Your accumulate and write when full strategy is a pretty good idea. It would be a data cache rather than a filehandle cache, and the implementation ought to be pretty straight forward. Implementing the file-handle LFU cache seems like it would be more fun though.


Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1013856]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others romping around the Monastery: (11)
As of 2016-10-26 14:11 GMT
Find Nodes?
    Voting Booth?
    How many different varieties (color, size, etc) of socks do you have in your sock drawer?

    Results (341 votes). Check out past polls.