|We don't bite newbies here... much|
Rerunning with 5.16 yielded a runtime of 60 seconds.
Conclusion: With 384GB of ram; your (relatively) tiny 10e6 lines test file is being read from system file cache, hence effectively disguising the disk IO costs.
If your 80GB file fits in cache and will always be there when you need to do this; you can ignore the effects of disk.
Otherwise ... you need to re-run all your testing using the real file and having flushed the cache before each test.
With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
"Science is about questioning the status quo. Questioning authority".
In the absence of evidence, opinion is indistinguishable from prejudice.
In reply to Re^9: selecting columns from a tab-separated-values file