Re: Normal/Gaussian random number generation for D

jerro Tue, 06 Nov 2012 10:15:32 -0800

I don't see any downside to this.
Which "this" do you mean? My current approach, or the addingof an extra separate function? :-)


Your current approach.

I have only been thinking about the Ziggurat algorithm, butyou are right, itdoes depend on the details of the technique. For Box-Muller(and other enginesthat cache samples) it only makes sense to compute the firstsamples in theopCall. But for the Ziggurat algorithm, tables that must becomputed before youcan start sampling aren't changed during sampling andcomputing the tablesdoesn't require any additional arguments. So it makes the mostsense for thosetables to be initialized in the struct's constructor in thestruct based API.

So we should assume by default then that the struct'sconstructor should take an RNG as input, to enable it tocalculate these first values if it needs to?

No, I think that if the engine defines initialize() function(with no parameters), it should be called in the constructor ofNormal. I don't think the constructor of Normal should take anRNG as input. I think what you currently do in normal.d is fine,I would just add something like this:


static if(is(typeof(_engine.initialize())))
    _engine.initialize();

to Normal's constructor. Then ZigguratEngine can defineinitialize() and do its initialization there (it doesn't need auniform RNG for initialization) and NormalBoxMullerEngine canremain unchanged.

In my previous post I was talking about initializing staticinstances of theengine used in the normal() function. The advantage ofinitializing in a staticconstructor is that you don't need an additional check everytime the normal()function is called. But because we will also have a structbased API, that willnot require such checks (at least not for all engines), thisisn't really thatimportant. So we can also initialize the global engineinstance in a call to
normal(), if this simplifies things.

I guess my feeling here is that the values generated by an RNGshould depend on when it is called, and not at all on when itis instantiated.
i.e. if I do something like

    auto nrng = Normal!()(0, 1);
    writeln( uniform(0.0, 1.0) );
    writeln( uniform(0.0, 1.0) );
    writeln( nrng() );
    writeln( nrng() );

I should get the same output as if I do,

    writeln( uniform(0.0, 1.0) );
    writeln( uniform(0.0, 1.0) );
    auto nrng = Normal!()(0, 1);
    writeln( nrng() );
    writeln( nrng() );

You can also think that if I change from e.g.

    auto nrng = Normal!(real, Engine1)(0, 1);
    writeln( uniform(0.0, 1.0) );
    writeln( uniform(0.0, 1.0) );
    writeln( nrng() );
    writeln( nrng() );

to

    auto nrng = Normal!(real, Engine2)(0, 1);
    writeln( uniform(0.0, 1.0) );
    writeln( uniform(0.0, 1.0) );
    writeln( nrng() );
    writeln( nrng() );
... then I would expect to see different results from thenormal RNG but identical results from uniform(). If theconstructor of the normal engine calls the RNG, the uniform()results will change, no?

I was only talking about the part of initialization that doesn'tuse a RNG. I agree that everything that uses a RNG should be donein opCall (or inside a normal() function in the functioninterface). For Box-Muller, I think the approach you currentlyuse in NormalBoxMullerEngine is the most reasonable one.But a Ziggurat engines needs to compute some tables before it canstart generating samples. It doesn't need a RNG to do that andthe tables do not change after initialization.

I think it's obvious that that initialization that doesn't need aRNG should be done in Normal's constructor for the structinterface. What is not so obvious is where the initialization ofstatic data that doesn't require a RNG should be done forfunction interface. That initialization can be done in normal()function, or it can be done in a static constructor. If you do itin normal(), you need to do an extra check on each call tonormal(). This isn't really a problem as long as the struct'sopCall and the version of normal() that takes the engine as aparameter don't do such redundant checks. Then the users thatcare about the difference in performance can just use one ofthese interfaces.

Yes, a random.d test suite probably should be another project.
Regardless of tests, let's focus for now on getting the APIright for this case ofnon-uniform-random-number-generator-with-internal-engine, withnormal and exponential as the initial cases.

In general I like the API in file normal.d attached to youroriginal post. I think the engines should have an option to dosome initialization in Normal's constructor, though. We couldachieve that by calling _engine.initialize in Normal'sconstructors, if such method exists. This method would also needto be called on the static instance of normal engine used in thenormal() function. We could add something like this to the firstversion of normal:


static if(is(typeof(engine.initialize())))
{
    static bool isInitialized;
    if(!isInitialized)
        engine.initialize();
}

Another option would be to do this:

struct GlobalEngine(Engine)
{
    Engine instance;

    static this()
    {
        instance.initialize();
    }
}

And then inside the version of normal that doesn't take an engineas a parameter:


alias GlobalEngine!NormalRandomNumberEngine E;
return normal(mean, sigma, E.instance, urng);

The users would need to construct their own instance of engine touse the function that takes engine as a parameter. So it wouldmake sense to add helper functions for creating engine instances.

There's one change that I think would make the API moreconvenient. Normal struct and the engine don't store an instanceof a RNG , so they don't need to take it as a template parameter.We could make opCall methods templates instead. That way theusers would never need to explicitly specify the type of the RNG.

Re: Normal/Gaussian random number generation for D

Reply via email to