in reply to Re: [OT] The statistics of hashing.
in thread [OT] The statistics of hashing.
Oh, something I forgot to mention: I tried using a constant number of bits but varying the vector size/quantity to see how things scaled. In other words, I compared:
|vec size||# vectors|
|10,000 || 1|
| 5,000|| 2|
| 3,333|| 3|
| 2,500|| 4|
| 2,000|| 5|
| 1,000||10 |
I found more smaller vectors works better until the number of samples matches the number if bits in the smaller vector. Plotting the functions:
using a graphing calculator shows that's where the curves cross:
When your only tool is a hammer, all problems look like your thumb.