[perl.git] branch blead, updated. v5.23.5-84-g7d9942e

Dave Mitchell Tue, 01 Dec 2015 06:00:46 -0800

In perl.git, the branch blead has been updated

<http://perl5.git.perl.org/perl.git/commitdiff/7d9942e329fa0ad09f22c706668103b6a0b73c7c?hp=4c298843f44226ef4a0089b71a623720a7aea3e6>


- Log -----------------------------------------------------------------
commit 7d9942e329fa0ad09f22c706668103b6a0b73c7c
Author: David Mitchell <da...@iabyn.com>
Date:   Tue Dec 1 12:32:34 2015 +0000

    op/rand.t: better test even spread of random nums
    
    The old test evaluated int(256*rand(1)) a large number of times, and
    calculated the average number of bits seen in the result. If this was too
    far from 4, it failed the (single) test. This is a rather crude test, and
    generated what may be false negatives quite often in smoke tests.
    
    This commit replaces that with a more comprehensive test scheme, but which
    should cause a false negative in the test script only once every 2 million
    runs, assuming a fair random number generator.
    
    As before, it calculates 256*rand(1) many times, but maintains a count of
    the number of occurrences of each result. Each count is then checked
    whether it is within 6 sigmas of the expected value. For example for
    100_000 iterations, we expect each count to be approximately 390, with a
    6-sigma range of 272..509. If any count is outside that range, it fails
    one of the 256 tests.
    
    Thus this script now does 256 tests rather than a single one, so is a lot
    better at detecting bad RNGs.
    
    With each test being 6-sigma (1 in 500e6 failures) and 256 tests, that
    gives us a false negative rate of approx 1 in every 2 million runs.

M       t/op/rand.t

commit 8b002d476a21e6f92b2a461d3255fe50587956ca
Author: David Mitchell <da...@iabyn.com>
Date:   Mon Nov 30 14:10:34 2015 +0000

    t/op/rand.t: modernise tests
    
    e.g. cmp_ok() rather than ok($x > 0), and give tests descriptions.

M       t/op/rand.t

commit f0bf11b219764f72d70d85e011ac6cd69cdea182
Author: David Mitchell <da...@iabyn.com>
Date:   Mon Nov 30 13:51:30 2015 +0000

    remove flawed tests from t/op/rand.t
    
    There is a section of code supposedly intended to check that
    d_randbits bits is sane. AFIKT, the code was flawed from the start,
    and has just become more broken since.
    
    Initially it ran rand(1) many times and recorded the min and max.
    It then (in a convoluted way involving logarithms) checked that
    $max wasn't greater than 1.
    
    I suspect that this code was (at least at one point during its initial
    writing) supposed to check that the number of bit of precision in rand()
    matched d_randbits. If that was the case, then it certainly wasn't doing
    that.
    
    Subsequently, extra bogus tests were added, e.g. that
        $max < (2 ** $randbits)
    which it always will be, since it should be < 1.
    
    Then the main test was inadvertently broken by a precedence issue
    involving '!': I don't think this does what you think it does:
    
        unless (ok( !$max <= 0 or $max >= (2 ** $randbits)))
    
    Then an extra check was added after each call to rand(1) in the loop that
    the result was in the range 0..1. This check made all the other checks
    that follow on $min and and $max superfluous.
    
    So this commit removes all those extra tests, and changes a couple
    of 'print #...\n"' into diag("..."), since we're now in the 21st century
    :-)

M       t/op/rand.t
-----------------------------------------------------------------------

Summary of changes:
 t/op/rand.t | 229 ++++++++++++++++--------------------------------------------
 1 file changed, 61 insertions(+), 168 deletions(-)

diff --git a/t/op/rand.t b/t/op/rand.t
index b3bfdd4..b3df32e 100644
--- a/t/op/rand.t
+++ b/t/op/rand.t
@@ -24,184 +24,79 @@ use strict;
 use Config;
 
 require "./test.pl";
-plan(tests => 10);
 
 
-my $reps = 15000;      # How many times to try rand each time.
+my $reps = 100_000;    # How many times to try rand each time.
                        # May be changed, but should be over 500.
                        # The more the better! (But slower.)
 
-sub bits ($) {
-    # Takes a small integer and returns the number of one-bits in it.
-    my $total;
-    my $bits = sprintf "%o", $_[0];
-    while (length $bits) {
-       $total += (0,1,1,2,1,2,2,3)[chop $bits];        # Oct to bits
-    }
-    $total;
-}
+my $bits = 8;  # how many significant bits we check on each random number
+my $nslots = (1<< $bits); # how many different numbers
+
+plan(tests => 7 + $nslots);
 
-# First, let's see whether randbits is set right
+# First, let's see whether randbits is set right and that rand() returns
+# an even distribution of values
 {
-    my($max, $min, $sum);      # Characteristics of rand
-    my($off, $shouldbe);       # Problems with randbits
-    my($dev, $bits);           # Number of one bits
-    my $randbits = $Config{randbits};
-    $max = $min = rand(1);
+    my $sum;
+    my @slots = (0) x $nslots;
+    my $prob = 1/$nslots;     # probability of a particular slot being
+                              # on a particular iteration
+
+    # We are going to generate $reps random numbers, each in the range
+    # 0..$nslots-1. They should be evenly distributed. We use @slots to
+    # count the number of occurrences of each number. For each count, we
+    # check that it is in the range we expect. For example for reps =
+    # 100_000 and using 8 bits, we expect each count to be around
+    # 100_000/256 = 390. How much around it we tolerate depends on the
+    # standard deviation, and how many deviations we allow. If we allow
+    # 6-sigmas, then that means that in only 1 run in 506e6 will be get a
+    # failure by chance, assuming a fair random number generator. Given
+    # that we test each slot, the overall chance of a false negative in
+    # this test script is about 1 in 2e6, assuming 256 slots.
+    #
+    # the actual count in a slot should follow a binomial distribution
+    # (e.g. rolling 18 dice, we 'expect' to see 3 sixes, but there's
+    # actually a 24% chance of 3, a 20% change of 2 or 4, a 12%
+    # chance of 1 or 5, and a 4% chance of 0 or 6 of them).
+    #
+    # This makes it easy to calculate the expected mean a standard
+    # deviation; see
+    #     https://en.wikipedia.org/wiki/Binomial_distribution#Variance
+
+    my $mean   = $reps * $prob;
+    my $stddev = sqrt($reps * $prob * (1 - $prob));
+    my $sigma6 = $stddev * 6.0; # very unlikely to be outside that range
+    my $min = $mean - $sigma6;
+    my $max = $mean + $sigma6;
+
+    note("reps=$reps; slots=$nslots; min=$min mean=$mean max=$max");
+
     for (1..$reps) {
        my $n = rand(1);
        if ($n < 0.0 or $n >= 1.0) {
-           print <<EOM;
-# WHOA THERE!  \$Config{drand01} is set to '$Config{drand01}',
-# but that apparently produces values < 0.0 or >= 1.0.
-# Make sure \$Config{drand01} is a valid expression in the
-# C-language, and produces values in the range [0.0,1.0).
-#
-# I give up.
+           diag(<<EOM);
+WHOA THERE!  \$Config{drand01} is set to '$Config{drand01}',
+but that apparently produces values ($n) < 0.0 or >= 1.0.
+Make sure \$Config{drand01} is a valid expression in the
+C-language, and produces values in the range [0.0,1.0).
+
+I give up.
 EOM
            exit;
        }
-       $sum += $n;
-       $bits += bits($n * 256);        # Don't be greedy; 8 is enough
-                   # It's too many if randbits is less than 8!
-                   # But that should never be the case... I hope.
-                   # Note: If you change this, you must adapt the
-                   # formula for absolute standard deviation, below.
-       $max = $n if $n > $max;
-       $min = $n if $n < $min;
-    }
-
-
-    # This test checks for one of Perl's most frequent
-    # mis-configurations. Your system's documentation
-    # for rand(2) should tell you what value you need
-    # for randbits. Usually the diagnostic message
-    # has the right value as well. Just fix it and
-    # recompile, and you'll usually be fine. (The main 
-    # reason that the diagnostic message might get the
-    # wrong value is that Config.pm is incorrect.)
-    #
-    unless (ok( !$max <= 0 or $max >= (2 ** $randbits))) {# Just in case...
-       print <<DIAG;
-# max=[$max] min=[$min]
-# This perl was compiled with randbits=$randbits
-# which is _way_ off. Or maybe your system rand is broken,
-# or your C compiler can't multiply, or maybe Martians
-# have taken over your computer. For starters, see about
-# trying a better value for randbits, probably smaller.
-DIAG
-
-       # If that isn't the problem, we'll have
-       # to put d_martians into Config.pm 
-       print "# Skipping remaining tests until randbits is fixed.\n";
-       exit;
-    }
-
-    $off = log($max) / log(2);                 # log2
-    $off = int($off) + ($off > 0);             # Next more positive int
-    unless (is( $off, 0 )) {
-       $shouldbe = $Config{randbits} + $off;
-       print "# max=[$max] min=[$min]\n";
-       print "# This perl was compiled with randbits=$randbits on $^O.\n";
-       print "# Consider using randbits=$shouldbe instead.\n";
-       # And skip the remaining tests; they would be pointless now.
-       print "# Skipping remaining tests until randbits is fixed.\n";
-       exit;
+        $slots[int($n * $nslots)]++;
     }
 
-
-    # This should always be true: 0 <= rand(1) < 1
-    # If this test is failing, something is seriously wrong,
-    # either in perl or your system's rand function.
-    #
-    unless (ok( !($min < 0 or $max >= 1) )) {  # Slightly redundant...
-       print "# min too low\n" if $min < 0;
-       print "# max too high\n" if $max >= 1;
+    for my $i (0..$nslots - 1) {
+        # this test should randomly fail very rarely. If it fails
+        # for you, try re-running this test script a few more times;
+        # if it goes away, it was likely a random (ha ha!) glitch.
+        # If you keep seeing failures, it means your random number
+        # generator is producing a very uneven spread of values.
+        ok($slots[$i] >= $min && $slots[$i] <= $max, "checking slot $i")
+            or diag("slot $i; count $slots[$i] outside expected range 
$min..$max");
     }
-
-
-    # This is just a crude test. The average number produced
-    # by rand should be about one-half. But once in a while
-    # it will be relatively far away. Note: This test will
-    # occasionally fail on a perfectly good system!
-    # See the hints for test 4 to see why.
-    #
-    $sum /= $reps;
-    unless (ok( !($sum < 0.4 or $sum > 0.6) )) {
-       print "# Average random number is far from 0.5\n";
-    }
-
-
-    #   NOTE NOTE NOTE NOTE NOTE NOTE NOTE NOTE NOTE NOTE
-    # This test will fail .006% of the time on a normal system.
-    #                          also
-    # This test asks you to see these hints 100% of the time!
-    #   NOTE NOTE NOTE NOTE NOTE NOTE NOTE NOTE NOTE NOTE
-    #
-    # There is probably no reason to be alarmed that
-    # something is wrong with your rand function. But,
-    # if you're curious or if you can't help being 
-    # alarmed, keep reading.
-    #
-    # This is a less-crude test than test 3. But it has
-    # the same basic flaw: Unusually distributed random
-    # values should occasionally appear in every good
-    # random number sequence. (If you flip a fair coin
-    # twenty times every day, you'll see it land all
-    # heads about one time in a million days, on the
-    # average. That might alarm you if you saw it happen
-    # on the first day!)
-    #
-    # So, if this test failed on you once, run it a dozen
-    # times. If it keeps failing, it's likely that your
-    # rand is bogus. If it keeps passing, it's likely
-    # that the one failure was bogus. If it's a mix,
-    # read on to see about how to interpret the tests.
-    #
-    # The number printed in square brackets is the
-    # standard deviation, a statistical measure
-    # of how unusual rand's behavior seemed. It should
-    # fall in these ranges with these *approximate*
-    # probabilities:
-    #
-    #          under 1         68.26% of the time
-    #          1-2             27.18% of the time
-    #          2-3              4.30% of the time
-    #          over 3           0.26% of the time
-    #
-    # If the numbers you see are not scattered approximately
-    # (not exactly!) like that table, check with your vendor
-    # to find out what's wrong with your rand. Or with this
-    # algorithm. :-)
-    #
-    # Calculating absolute standard deviation for number of bits set
-    # (eight bits per rep)
-    $dev = abs ($bits - $reps * 4) / sqrt($reps * 2);
-
-    ok( $dev < 4.0 );
-
-    if ($dev < 1.96) {
-       print "# Your rand seems fine. If this test failed\n";
-       print "# previously, you may want to run it again.\n";
-    } elsif ($dev < 2.575) {
-       print "# This is ok, but suspicious. But it will happen\n";
-       print "# one time out of 25, more or less.\n";
-       print "# You should run this test again to be sure.\n";
-    } elsif ($dev < 3.3) {
-       print "# This is very suspicious. It will happen only\n";
-       print "# about one time out of 100, more or less.\n";
-       print "# You should run this test again to be sure.\n";
-    } elsif ($dev < 3.9) {
-       print "# This is VERY suspicious. It will happen only\n";
-       print "# about one time out of 1000, more or less.\n";
-       print "# You should run this test again to be sure.\n";
-    } else {
-       print "# This is VERY VERY suspicious.\n";
-       print "# Your rand seems to be bogus.\n";
-    }
-    print "#\n# If you are having random number troubles,\n";
-    print "# see the hints within the test script for more\n";
-    printf "# information on why this might fail. [ %.3f ]\n", $dev;
 }
 
 
@@ -219,11 +114,9 @@ DIAG
     # within the range 0 - 100, and that the numbers produced
     # have a reasonably-large range among them.
     #
-    unless ( ok( !($min < 0 or $max >= 100 or ($max - $min) < 65) ) ) {
-       print "# min too low\n" if $min < 0;
-       print "# max too high\n" if $max >= 100;
-       print "# range too narrow\n" if ($max - $min) < 65;
-    }
+    cmp_ok($min,        '>=',  0, "rand(100) >= 0");
+    cmp_ok($max,        '<', 100, "rand(100) < 100");
+    cmp_ok($max - $min, '>=', 65, "rand(100) in 65 range");
 
 
     # This test checks that rand without an argument
@@ -239,7 +132,7 @@ DIAG
     # This checks that rand without an argument is not
     # rand($_). (In case somebody got overzealous.)
     # 
-    ok($r < 1,        'rand() without args is under 1');
+    cmp_ok($r, '<', 1,   'rand() without args is under 1');
 }
 
 { # [perl #115928] use a standard rand() implementation

--
Perl5 Master Repository

[perl.git] branch blead, updated. v5.23.5-84-g7d9942e

Reply via email to