Beefy Boxes and Bandwidth Generously Provided by pair Networks
The stupid question is the question not asked

Comment on

( #3333=superdoc: print w/replies, xml ) Need Help??
Mabooka % head -1 numbers.csv 462,393,252,996,663,603,344,439,139,259,879,766,545,192,477,986,317,77 +,611,303,79,742,190,556,538 Mabooka % wc -l numbers.csv 500000 numbers.csv Mabooka % Mabooka % time perl -lanF, -e 'print join ",", @F[0..4];' numbers.csv +> f1 27.820u 0.100s 0:27.92 100.0% 0+0k 0+0io 320pf+0w Mabooka % time cut -d, -f"1-5" numbers.csv > f2 1.860u 0.100s 0:01.96 100.0% 0+0k 0+0io 100pf+0w Mabooka % diff f1 f2 Mabooka %
So it's clear what to use (if it's a bottleneck problem rather than an academic disput).

Now, back to the original problem (sum up columns): on my system, for 500,000 it's negligible, so I tried with 5,000,000 x 25 cols:
Mabooka % time perl -nle'my($a,$b,$c,$d,$e)=split /,/;$ta+=$a, $tb+=$b +, $tc+=$c, $td+=$d, $te+=$e;END{print join " ", $ta,$tb,$tc,$td,$te}' + numbers.csv 2499084140 2499188390 2500073650 2497725180 2495867770 45.270u 0.200s 0:45.44 100.0% 0+0k 0+0io 322pf+0w Mabooka % Mabooka % time sum5.cut_n_awk 2499084140 2499188390 2500073650 2497725180 2495867770 18.520u 0.490s 0:12.52 151.8% 0+0k 0+0io 575pf+0w Mabooka %
, where:
Mabooka % cat sum5.cut_n_awk # cat numbers.csv | cut -f1,2,3,4,5 -d, |awk -F, '{s1 += $1; s2 += $2; s +3+= $3; s4+=$4; s5+=$5} END {printf ("%.0f %.0f %.0f %.0f %.0f\n", s +1, s2,s3,s4,s5)}'

3-4 times difference isn't bad. Maybe this would help...

In reply to Re: cut vs split (suggestions) by Mabooka-Mabooka
in thread cut vs split (suggestions) by sk

Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":

  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.
  • Log In?

    What's my password?
    Create A New User
    and all is quiet...

    How do I use this? | Other CB clients
    Other Users?
    Others avoiding work at the Monastery: (5)
    As of 2018-06-23 04:54 GMT
    Find Nodes?
      Voting Booth?
      Should cpanminus be part of the standard Perl release?

      Results (125 votes). Check out past polls.