Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl: the Markov chain saw
 
PerlMonks  

Re^4: selecting columns from a tab-separated-values file

by ibm1620 (Hermit)
on Jan 23, 2013 at 20:25 UTC ( [id://1015017]=note: print w/replies, xml ) Need Help??


in reply to Re^3: selecting columns from a tab-separated-values file
in thread selecting columns from a tab-separated-values file

I'm back at work and have tested the two-process solution. It took 60 seconds to pass 10M (M=million) records. Then I pulled the logic for splitting and joining the records out of obuf and into ibuf (thus eliminating obuf) and ran the same test, and it ran in 62 seconds. (In both cases the output was to /dev/null.)<\p>

I reran the tests sending output to an actual file in the same directory as the input, and obtained exactly the same runtimes.

In ALL cases I observed the CPU of the process that was doing the split/join to peg at 100%.

So I have to conclude that disk I/O is negligible for this program, in my environment.

Replies are listed 'Best First'.
Re^5: selecting columns from a tab-separated-values file
by BrowserUk (Patriarch) on Jan 23, 2013 at 23:09 UTC
    It took 60 seconds to pass 10M (M=million) records.

    Hm. 10e6 in a minute suggests a total time for 1e9 of well under 2 hours.

    This post mentions a time of 5 hours. What changed?


    With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
    Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
    "Science is about questioning the status quo. Questioning authority".
    In the absence of evidence, opinion is indistinguishable from prejudice.
      The number of fields being extracted, mainly. My example of three fields was just a simplified illustration of my question.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1015017]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others taking refuge in the Monastery: (4)
As of 2024-03-19 08:30 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found