"Let's face it. Your suggestion is a crock."
Again, you assume. You assumed i suggested Hadoop. I merely pointed out that "... [script] builds a big list in memory and then partitions the matching files into 100+ lists (1 per cluster instance) and writes the to separate files" is what Hadoop (not EC2) gives you for "free," as in you don't have to roll that wheel.
Do you have anything to say about taking credit for tilly's suggestion just now? When you said, and i quote, "I even suggested it here a couple of weeks ago."
Update: I suppose not.