[jira] [Commented] (RNG-181) LXM generators to implement SplittableUniformRandomProvider

Alex Herbert (Jira) Thu, 15 Sep 2022 03:41:05 -0700


    [ 
https://issues.apache.org/jira/browse/RNG-181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605255#comment-17605255
 ]


Alex Herbert commented on RNG-181:
----------------------------------

I have completed an initial version for review ([PR 
117|https://github.com/apache/commons-rng/pull/117]).
h2. Seed generation to mix with a stream position
{noformat}
rngSeed = ((seed << shift) | position){noformat}
The seed generation routine for the stream methods uses a character size of 4. 
The seed is tested to be uniformly distributed among the 8 odd characters for 
the least significant digit. Each digit is tested to have a uniform 
distribution of the other 15 possible characters in the remaining digits of the 
seed.

This routine has been updated to avoid the use of {{nextInt(n)}} when replacing 
duplication of the least significant unique character. Any algorithm for 
{{nextInt(n)}} must reject some samples to ensure uniformity. This is typically 
performed by taking the output of {{nextInt}} (assumed to be uniform) and using 
only the the uniform range that is a multiple of n. Any part of the range left 
over is rejected. This range will be less than n in size.

One rejection algorithm uses:
{code:java}
// x = floor(n * fraction)
//   = floor(n * [0, 2^32) / 2^32)
int x = (n * Integer.toUnsignedLong(nextInt())) >> 32;
{code}
When n=15 the remainder (2^32 % 15) == 1. Thus the value almost exact divides 
into 2^32. Only one value output from nextInt() must be rejected to create a 
uniform sample from the remaining range [0, 2^32 - 1). The rejection rate is 
therefore 2^-32 and the bias in the sample is negligible. The algorithm can 
thus be updated to detect duplicate occurrences of the least significant digit 
and replace them using a branchless routine using 4 bytes of randomness 
(nextInt()) per character. Given the rejection rate of 1/16 for 15 characters 
the expected number of bytes used to generate an 8-byte seed is approximately 
12.
h2. Stream support

The implementation uses a static helper class to create a stream. The helper 
can be used from different packages for both the IntProvider and LongProvider 
classes. This exposes the following public API:
{code:java}
public final class RandomStreams {

    /**
     * A factory for creating objects using a seed and a using a source of 
randomness.
     *
     * @param <T> the object type
     */
    public interface ObjectFactory<T> {
        /**
         * Creates the object.
         *
         * @param seed Seed used to initialise the instance.
         * @param source Source of randomness used to initialise the instance.
         * @return the object
         */
        T create(long seed, UniformRandomProvider source);
    }

    /**
     * Returns a stream producing the given {@code streamSize} number of new 
objects
     * generated using the supplied {@code source} of randomness using the 
{@code factory}.
     *
     * <p>A {@code long} seed is provided for each object instance using the 
stream position
     * and random bits created from the supplied {@code source}.
     */
    public static <T> Stream<T> generateWithSeed(long streamSize,
                                                 
SplittableUniformRandomProvider source,
                                                 ObjectFactory<T> factory);
}

// Used as:
public class L64X128Mix implements SplittableUniformRandomProvider {

    @Override
    public Stream<SplittableUniformRandomProvider> splits(long streamSize, 
SplittableUniformRandomProvider source) {
        return RandomStreams.generateWithSeed(streamSize, source, 
L64X128Mix::create);
    }

    private static SplittableUniformRandomProvider create(long seed, 
UniformRandomProvider source) {
        // Generate random state ...

        return new L64X128Mix(s0, s1, x0, x1);
    }
}{code}
The alternative to a public utility class is:
 # To place this functionality in the BaseProvider. However since the 
IntProvider/LongProvider reside in a different package the functionality would 
have to be exposed as protected.
 # To place the functionality within the same package as the Int/LongProviders, 
this would create no public API changes but require code duplication.

Note: Creating a new abstract base class for splittable generators would also 
require code duplication. Currently the IntProvider and LongProvider extend 
BaseProvider from a different package. These providers add the caching 
functionality for some generator methods and support the save/restore state 
functionality. If splittable generators extend a new AbstractSplittableProvider 
they would have to reimplement the cache functionality. Exposing the 
functionality through either a protected/public method is the solution with 
minimal code duplication. I opted for the public utility class as the 
functionality is then distinct from the core provider functionality in the 
BaseProvider. It may also be used to generate streams of other objects types 
that can be created with a seed and source of randomness. However I do not 
think this can be applied directly to streams of SharedStateSamplers as they 
create new instances using only a new source of randomness and the current API 
cannot easily exploit the provided unique seed.
h2. Avoiding a XBG zero state

The LXM generators require a non-zero state for the xor-based generator (XBG). 
This is typically handled by creating them using the seeding routines in the 
{{commons-rng-simple}} package. If this occurs during a split then it must be 
corrected. A SplitMix style generator is used to create the state for the XBG 
state. This has been done by reusing the mix function from the LXM generator to 
mix a Weyl sequence created using the golden ratio. This avoids exposing 
further API constants/methods outside the packages. The alternative to reuse 
the mix functions and constants in the BaseProvider would require changing them 
from private to protected. Since the BaseProvider seeding routine is not 
subject to behaviour compatibility between versions minimising exposure of the 
internals is more appropriate. The chance of a zero state for the XBG from a 
random seeding is at least 2^-128 (two longs of zero) or 2.9e-39. This 
functionality is an extremely rare edge case and the details are not 
significant to the generator behaviour. Note the unit tests do create situation 
this by passing in a faulty source of randomness for the split and testing the 
output does not match a new generator seeded with zeros.
h2. Split implementation

The splitting routine has been implemented in each LXM class. The alternative 
for the LongProviders could have used a parent abstract class with a seeding 
routine based on a {{{}long[]{}}}. Since the smaller state providers have a 
constructor accepting primitive values this would add object creation overhead 
for the array for each split. Since the hot path for the split is simply to 
create N new long values for the seed and return a new instance then the small 
amount of repetitive code keeps all split code within the class and avoids the 
complexity of adding to the class hierarchy. Note this is different from the 
implementation of the jump functions in an abstract parent class. In that case 
the jump function is exactly the same implementation as jumping only requires 
advancing the linear congruential generator (LCG) using a specialised 
multiply-add routine.
h2. API Compatibility

Use of the splittable interface by other modules requires an exception in 
RevAPI for exposing an external class in the API. This is similar to exposure 
of UniformRandomProvider and is allowed. The change is non-breaking for binary 
and source compatibility (see revapi 
[java.class.externalClassExposedInAPI|https://revapi.org/revapi-java/0.22.1/differences.html#_external_class_exposed_in_api_java_class_externalclassexposedinapi]).

> LXM generators to implement SplittableUniformRandomProvider
> -----------------------------------------------------------
>
>                 Key: RNG-181
>                 URL: https://issues.apache.org/jira/browse/RNG-181
>             Project: Commons RNG
>          Issue Type: New Feature
>          Components: core
>    Affects Versions: 1.5
>            Reporter: Alex Herbert
>            Assignee: Alex Herbert
>            Priority: Major
>             Fix For: 1.5
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> The LXM family of generators use a linear congruential generator (LCG) mixed 
> with a xor-based generator. The paper introducing these generators indicates 
> that these are suitable for splitting as the streams are demonstrated to be 
> independent when the addition parameter for the LCG is different. The authors 
> provide results for outputting the combined (interleaved) sequence from up to 
> 2^24 generators with different LCG add parameters through Test U01 BigCrush 
> with no failures.
> These generators are splittable in the JDK 17 java.util.random package. 
> Splitting functionality should be added to the family in Commons RNG.
> A single split of the generator is made by randomly seeding a new generator 
> using the existing one. This is no different than simply creating a new 
> generator from any source of randomness.
> When the generator is used to make a Stream<SplittableUniformRandomProvider> 
> then each stream instance can be created using a different addition 
> parameter. This can be done by exploiting the fact that the spliterator for 
> the stream maintains the stream position as a count in [0, streamSize). The 
> addition parameter can be created by using this count. To ensure a second 
> stream will have different seeding the count can be combined with random bits 
> that change for each stream instance.
> h2. JDK Implementation
> The exact mechanism used in the JDK is not documented (in the public javadoc) 
> so may be subject to change. The present source code adds the stream position 
> bits to random bits to create the addition. The seed random bits are 
> generated with the some of the lowest bits set to 1. The addition parameter 
> is created by shifting the seed random addition left until the lowest 1-bit 
> is above the highest 1-bit of the position. The two can be combined (with 
> xor) to create an addition which will be unique among the stream. 
> The present implementation in the JDK creates the initial seed as a series of 
> n-bit characters using a seed generation algorithm; the lowest n-bits are set 
> to 1 to act as a marker for the end of the seed. The algorithm to create the 
> initial seed characters uses the overflow from a multiply congruential 
> generator (MCG) with m=15 to create the digits as 4-bits. The source code 
> indicates that the character size could be any supported number in [2, 63]. 
> However the more bits that are set to mark the lower position of the seed 
> impacts the amount of randomness left in the remaining bits.
> The initial seed characters are created using Math.multiplyHigh which is 
> signed. The result is that the digits in the seed have a non-uniform 
> distribution as the overflow character is subject to sign extension 
> correction in the multiply. If the same algorithm uses JDK 18s 
> Math.unsignedMultiplyHigh then the digits have a uniform distribution.
> Since the intrinsic functions Math.multiplyHigh and Math.unsignedMultiplyHigh 
> are not available in JDK 8 then a similar approach using a MCG would be slow.
> Details to take away are that the counter and initial random seed are 
> combined. Overlap of the seed and the counter are avoided by shifting the 
> seed left. Some bits are used as a marker in the seed which reduces the 
> randomness. Ideally the initial seed should have randomly distributed bits; 
> this is not the case in the JDK 17 implementation due to the generation 
> algorithm used.
> h2. Proposed Implementation
> To create a random addition combine the stream position with random bits.
> Notes:
>  # The stream position will be a positive long. However the addition for the 
> LCG must be odd. This is done in the LXM family by setting the lowest 1 bit 
> to 1. Thus the count must be shifted left 1-bit to avoid duplications in the 
> seed sequence. This removes the ability to add extra bits to seed additions 
> when the stream size exceeds 2^63. Such large streams are not expected in 
> common use.
>  # A counter that is enumerated from 0 to 2^n - 1 will be uniformly 
> distributed for each bit in the first n-bits. However when used as an 
> addition to a sum in an LCG all of these possible additions may not trickle 
> up changes very far into higher bits. 1-bits must be present in higher 
> positions of the addition to effect higher bits in the sum. If all the higher 
> bits are the same then the trickle up effect will be similar. So it is an 
> advantage to have the seed counter combined with different random bits.
> A random seed could be xor'd with the counter. However the upper bits for all 
> additions would be the same above the highest 1-bit in the sequence size. 
> This reduces the variation in the addition parameter for use by the LXM 
> generator.
> Thus is makes sense to create a random seed, then shift it left as the 
> counter increases in magnitude. Note that the shift to avoid overlap with the 
> counter could be maintained using a separate shift counter. However if the 
> initial random seed bits have many zeros in the low bits it may be possible 
> to left shift this and have nothing to combine with a large value counter. 
> This may lead to duplications of previous ((seed << shift) | count) values as 
> the counter increases in magnitude. If there is always at least 1 bit to 
> prepend to the counter then no duplications will occur as any larger counts 
> that match a previous ((seed << shift) | count) will also have at least 1 bit 
> prepended. This limits the seed addition to 63-bits of randomness.
> Since the lowest 1-bit is enabled for the LCG add parameter this leaves 
> 62-bits of randomness to seed the addition parameter. A stream with more than 
> 2^62 elements would not be able to ensure a unique addition parameter for 
> each RNG in the stream. Such streams are unlikely to be used. A more 
> realistic parallel stream size of less than 1024 generators would only 
> consume 9 bits for the counter and the seed addition will contain at least 53 
> bits of randomness accounting for the two bits lost to the marker bit and the 
> odd LCG add parameter.
> h2. Code changes
> This functionality can be introduced in a new abstract base class for 
> splittable generators that can split using a seed value. The seed value would 
> be ensured to be unique within a stream (subject to the size limitations 
> above). The current LXM generators extend either IntProvider or LongProvider. 
> The LongProviders all use a 64 or 128-bit LCG and can thus use the entire 
> seed value when splitting. The IntProvider is the L32X64Mix generator which 
> uses a 32-bit LCG. This can only use 31-bits from the seed for the LCG add 
> parameter. It limits the number of unique add parameters within a stream to 
> 2^30 before duplications may be observed. This generator is splittable in the 
> JDK. It is used as the default generator in JDK 17 for 
> RandomGenerator.getDefault(). Supporting splitting in Commons RNG provides 
> parity with the modern JDK random implementation.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (RNG-181) LXM generators to implement SplittableUniformRandomProvider

Reply via email to