Beefy Boxes and Bandwidth Generously Provided by pair Networks
more useful options

Re^2: Searching array against hash

by drhicks (Novice)
on Aug 21, 2013 at 21:54 UTC ( #1050424=note: print w/replies, xml ) Need Help??

in reply to Re: Searching array against hash
in thread Searching array against hash

Wow thanks, only takes a couple seconds to finish! I had attempted to do the same thing, but could never get it working, and wasn't sure if/how it would actually increase the speed. Thanks again

Replies are listed 'Best First'.
Re^3: Searching array against hash
by davido (Archbishop) on Aug 21, 2013 at 22:10 UTC

    The "if/how" is this:

    Your first solution had an outer loop that runs 900,000 times. Inside that outer loop, there's an inner loop that runs 60,000 times. 60k * 900k is 54 billion total iterations inside the inner loop.

    The proposed solution created a hash of 60000 elements. Then your 900000 line file is read line by line. Inside of that loop that iterates 900000 times, there's a hash lookup, which is almost free. There are a total of 60000 iterations needed to build the hash, and 900000 iterations needed to test each line of the FASTA file. The amount of work being done is, therefore, 960000 iterations.

    Think of loops inside of loops as doing n*m amount of work, whereas loops followed by loops (no nesting) do n+m amount of work. Anytime you have the choice of an algorithm where the order of growth is the mathematical product of two large numbers, or an algorithm where the growth rate is the mathematical sum of the same two numbers, your sense of economic utility should be telling you that the latter will scale up better.


Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1050424]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others browsing the Monastery: (4)
As of 2018-05-26 10:34 GMT
Find Nodes?
    Voting Booth?