http://www.perlmonks.org?node_id=1117945


in reply to Re^2: [OT] Stats problem
in thread [OT] Stats problem

If you're not convinced then why not build yourself a small monte carlo simulation and try it out?

Replies are listed 'Best First'.
Re^4: [OT] Stats problem
by BrowserUk (Patriarch) on Feb 27, 2015 at 11:48 UTC

    Atfer 5000 runs of 1/2 billion checks per run; the result is:

    ... Run: 2000 buks:231 stds:273 ... Run: 3000 buks:346 stds:393 ... Run: 4000 buks:466 stds:520 ... Run: 5000 buks:577 stds:663 Run: 5001 buks:578 stds:664 Run: 5002 buks:579 stds:664 Run: 5003 buks:579 stds:664 Run: 5004 buks:579 stds:665

    So the odds are:

    5000 * 536870912 = 2684354560000 total checks / false hits = odds of a false hit / 579 = 4636190949 offset / 665 = 4036623398 0xdeadbeef Expected odds = 4294967296

    Which mean you are right!

    However, as the statistics above reflect, and as I observed from several (very) short runs whilst sanity checking the code; the offset seems to beat the odds every time; whilst the fixed magic number seems to come a little shy of it every time.

    There are not enough observations and not a sufficiently big difference between them to conclude that this is anything other than expected variation. But it does seem consistent.

    I've started another (low priority) run with some sanity check code enabled that counts the occurrences of each random value seen. The extra code means it runs much more slowly; and the restrictions of my physical memory mean I've had to limit the counts to unsigned bytes; but by outputting when those counts rollover it should give a clear indication of whether all values are being generated, as 96 % of them should rollover within a few dozen runs of each other -- if my calculations are correct I should see the bulk of them at around 1024 runs mark.

    All of which goes to reinforce my long standing observation that -- for me -- statistics is the second most unintuitive thing -- after quantum mechanics -- that I know just-enough-to-be-dangerous about.

    At least with QM I'm in good company when it comes to finding it spooky :)


    With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
    Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
    "Science is about questioning the status quo. Questioning authority". I'm with torvalds on this
    In the absence of evidence, opinion is indistinguishable from prejudice. Agile (and TDD) debunked

      Definitely unintuitive! Isn't quantum mechanics just statistics with extra difficulty and a physics PHD ? ;)

      At least with QM I'm in good company when it comes to finding it spooky :)

      I resemble that remark!

      -QM
      --
      Quantum Mechanics: The dreams stuff is made of

Re^4: [OT] Stats problem
by BrowserUk (Patriarch) on Feb 26, 2015 at 23:06 UTC
    build yourself a small monte carlo simulation

    Well, its running, but it'll need to run for a while to be statistically valid.

    In the meantime are you saying that the fact that the value at any given offset has to match both the value and the offset has no influence upon the chances of a false positive?

    The UK national lottery picks 6 balls from 49: 49!/(6!*(49-6)!) = 1:13,983,816 chance.

    And once all 6 balls are out of the machine; they reorder them by ascending value; so the result is always shown as B1 < B2 < B3 < B4 < B5 < B6.

    But if players had to match both the numbers and their draw order, it would be a lot harder. The odds would be 6! * 13,983,816 = 1:10,068,347,520.

    So, value and position: highly increased odds; but you're saying that's not a factor here?


    With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
    Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
    "Science is about questioning the status quo. Questioning authority". I'm with torvalds on this
    In the absence of evidence, opinion is indistinguishable from prejudice. Agile (and TDD) debunked

      As I understand it the 2 events are completely independent. At the time the corruption occurs there a 100% percent chance that each cell has a fixed value, so it's just not relevant how that value was arrived at.