Re: std.random2

ixid Tue, 08 Jan 2013 10:40:26 -0800

On Tuesday, 8 January 2013 at 17:52:24 UTC, Joseph RushtonWakeling wrote:

Hello all,
Following discussion on the pull request adding normal randomnumber generation to std.random [https://github.com/D-Programming-Language/phobos/pull/1029 ],some issues were raised which are best discussed with the wholecommunity.
The heart of this is the design of pseudo-random numbergenerators (PRNGs) in Phobos. Currently these are implementedas value types, which has a number of unpleasant effects:
-- They're expensive to pass around (some PRNGs have a sizeof a MB or more)
-- Passing by value is statistically unsafe, as it canresult in identicalrandom sequences being generated in different parts ofthe code. Thisalready affects at least one part of std.random itself:see,
      http://d.puremagic.com/issues/show_bug.cgi?id=8247
http://forum.dlang.org/thread/[email protected]
-- Simply passing by reference isn't an adequate solution,as there will becases (as with RandomSample, detailed in bug 8247) whereyou have tostore the RNG. Storing e.g. a pointer or reference wouldbe unsafe; theonly adequate solution is that PRNGs be (safe) referencetypes.
monarch_dodra did some work on this which was set aside in theshort term because it would (unavoidably) be a breaking change[ https://github.com/D-Programming-Language/phobos/pull/893 ].To avoid this, the proposed solution is to create a std.random2.
However, these issues seem to me to have broader implicationsfor the design of random-number functionality in Phobos, so ifwe're going to do a std.random2, it's worth mapping them out soas to get them right.
The most obvious (to me) is that these issues which apply toPRNGs apply equally well to random number distributions. Forexample the Ziggurat algorithm requires storing several hundredconstants, so passing by value is expensive; and severaldifferent algorithms generate and store multiple randomvariates at a time, so copying/passing by value will result inunintended correlations in sequences of variates.
This has further implications if for example we want to createa VariateGenerator (or perhaps in D-ish terms, a VariateRange)which couples a random distribution with a PRNG -- this isunlikely to work unless both the random distribution and thePRNG are reference types.
Finally, there are more general issues about how newfunctionality should be implemented. C++11 is given as a modelin the std.random documentation, but this is clearly a guiderather than something to copy blindly -- existing functions andstructs already deviate from it in ways that reflect D'spreference for ranges and its superior generics. We need aclear picture of how to do this for functionality that has notyet been implemented.
For example: in the case of random distributions, the currentexample of uniform() offers only a function interface andtherefore little guidance about how to create structimplementations for more complex algorithms which requirepersistent storage (e.g. Ziggurat or Box-Muller). Should theyfollow C++11/Boost.Random in returning variates via opCall, orshould they be coupled with a PRNG at construction time (aswith RandomSample) and implement a range of variates?
My inclination here is to take some time to map out thedifferent interface/design options and to present the choicesto the community for review as a precursor to creating astd.random2. It seems the only really sensible choice toensure that we get a good future-proof design.
What does everyone think?

Best wishes,

    -- Joe

I imagine there has been some detailed discussion of thestd.nameX idea of libraries so forgive me if this has beendiscussed. Using this as an approach to essentially replacinglibraries instead of the depreciation route would seem topotentially lead to a situation where you have std.name1, 2, 3, 4ad infinitum which may not have exactly the same feature set oruse cases and would lead to people hunting around multiplelibraries which on the face of it are for the same thing andusing them in projects. Would it not be better to bite the bulletand depreciate? random2 simply sounds like random done properly,with random and random2 people would often use random, and sostill suffer the consequences of its issues, and possiblyoverlook random2. You're depreciating because it's broken, notbecause people want to mess around with trivialities.

Re: std.random2

Reply via email to