http://www.perlmonks.org?node_id=890800


in reply to Re: randomising file order returned by File::Find
in thread randomising file order returned by File::Find

"... [script] builds a big list in memory and then partitions the matching files into 100+ lists (1 per cluster instance) and writes the to separate files."

This is pretty much what Hadoop does for you.

jeffa

L-LL-L--L-LL-L--L-LL-L--
-R--R-RR-R--R-RR-R--R-RR
B--B--B--B--B--B--B--B--
H---H---H---H---H---H---
(the triplet paradiddle with high-hat)