in reply to
(contest) Help analyze PM reputation statistics

If we consider the node reputation as a random variable, we can perform some interesting analyses. The entropy of the reputation random variable is 5.27, meaning that the theoretical lower bound for storing the information contained in the node reputations is 5.27 bits per post. So when someone says "node reputation isn't worth 2 bits," they're wrong -- it is actually worth at least 5.27 bits. ;)

`use List::Util 'sum';
my $sum = sum values %rep_stats;
my $entropy = sum map { -($_/$sum) * log($_/$sum) / log(2) }
values %rep_stats;
printf "Total entropy: %.05f\n", $entropy;
`

An interesting statistic would be whether the entropy of the reputation random variable is going up or down over time. Then we could say whether node reputation was becoming more or less meaningful.