|Pathologically Eclectic Rubbish Lister|
Re^2: [OT] The statistics of hashing. (birthday)by BrowserUk (Pope)
|on Apr 01, 2012 at 04:13 UTC||Need Help??|
By those values, the odds against not having seen a duplicate by the time you reached 100 million inserts are so low as to be a pretty damn good definition of 'impossible'.
And yet, empirically, none had been seen by the time I reached 779,967,210.
And after 1.5 billion inserts, that calculation suggests that the odds of finding a value that doesn't match would be minuscule, and the "possible dups" count should be growing at almost the same rate as the new inserts are being tested.
The reality is that I've only had 1323 collisions after 1.5 billion inserts. These are the last 8:
They are coming much more regularly now, but they are actually still within the bounds of usability.
With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
"Science is about questioning the status quo. Questioning authority".
In the absence of evidence, opinion is indistinguishable from prejudice.