Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical

Re^2: [OT] The statistics of hashing.

by roboticus (Chancellor)
on Apr 02, 2012 at 14:07 UTC ( #963013=note: print w/replies, xml ) Need Help??

in reply to Re: [OT] The statistics of hashing.
in thread [OT] The statistics of hashing.

Oh, something I forgot to mention: I tried using a constant number of bits but varying the vector size/quantity to see how things scaled. In other words, I compared:

vec size# vectors
10,000 1
5,000 2
3,333 3
2,500 4
2,000 5

I found more smaller vectors works better until the number of samples matches the number if bits in the smaller vector. Plotting the functions:

(1-exp(-x/1000))^10 (1-exp(-x/2000))^5 (1-exp(-x/2500))^4 (1-exp(-x/3333))^3 1-exp(-x/10000)

using a graphing calculator shows that's where the curves cross:


When your only tool is a hammer, all problems look like your thumb.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://963013]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others examining the Monastery: (4)
As of 2016-10-25 05:30 GMT
Find Nodes?
    Voting Booth?
    How many different varieties (color, size, etc) of socks do you have in your sock drawer?

    Results (315 votes). Check out past polls.