Re: hap.random: a new random number library for D

Joseph Rushton Wakeling via Digitalmars-d-announce Tue, 10 Jun 2014 23:46:30 -0700

On Tuesday, 10 June 2014 at 23:08:33 UTC, Chris Cain wrote:

I had an opportunity to give the entire code a good once overread and I have a few comments.


Thanks! :-)

1. Biggest thing about the new hap.random is how much nicer itis to actually READ. The first few times I went through thecurrent std.random, I remember basically running out of breath.hap.random was almost a refreshing read, in contrast. I'mguessing it has a lot to do with breaking it down into smaller,more manageable pieces. Regardless, good work on that. Isuspect it'll make it easier to contribute to in the future.

That's great to hear, as it was a design goal. I think therewill probably at some point be a need to separate things further(e.g. std.random.generator will probably have to be separated aswill std.random.distribution) but always keeping the principle ofimplementing packages to make it possible to just "importhap.random" (or "import hap.random.generator", or whatever).

2. Something I'd really like to see is for the seed-by-rangefunctions to take the range by reference instead of by value toensure that the seed values used are less likely to be used inanother RNG inadvertently later. Basically, I envision asimilar problem with seedRanges as we currently have with RNGswhere we have to make sure people are careful with what they dowith the ranges in the end. This should cover use cases whereusers do things like `blah.seed(myEntropyRange.take(3))` aswell, so that might take some investigation to figure out howrealistic it would be to support.

Yea, that's an interesting point. I mean, you'd hope thatmyEntropyRange would be a reference type anyway, but every littlehelps :-)

3. I'd also REALLY like to see seed support ranges/valuesgiving ANY type of integer and guarantee that few bytes arewasted (so, if it supplies 64-bit ints and the generator'sinternal state array only accepts 32-bit ints, it should spreadthe 64-bit int across two cells in the array). I have workingcode in another language that does this, and I wouldn't mindporting it to D for the standard library. I think this wouldgreatly simplify the seeding process in user code (since theywouldn't have to care what the internal representation of theRandom state is, then).


That would be very cool.  Can you point me at your code examples?

4. I'd just like to say the idea of using ranges for seeds getsme giddy because I could totally see a range that querieshttps://random.org for true random bits to seed with, wrappedby a range that zeroes out the memory on popFront. Convenientand safe (possibly? Needs review before I get excited,obviously) for crypto purposes!

The paranoiac in me feels that anything that involves gettingrandom data via HTTPS is probably insecure crypto-wise :-)However, I think sourcing random.org is a perfect case for anentry in hap.random.device. I think the best thing to do wouldprobably be to offer a RandomOrgClient (which offers a very thinAPI around the random.org HTTP API) and then to wrap that in arange type that uses the client internally to generate randomnumbers with particular properties.

5. Another possible improvement would be something akin to a"remix" function. It should work identically to reseeding, butinstead of setting the internal state to match the seed (as Isee inhttps://github.com/WebDrake/hap/blob/master/source/hap/random/generator.d#L485),remixing should probably be XOR'd into the current state. Thatway if you have a state based on some real entropy, you canslowly, over time, drip in more entropy into the state.

Also a very interesting suggestion. Is there a standard name forthis kind of procedure?

6. I'd like to see about supporting xorshift1024 as well(described here: http://xorshift.di.unimi.it/ and it's publicdomain code, so very convenient to port ... I'd do it too, ofcourse, if that seems like an okay idea). This is a reallysmall thing because xorshift1024 isn't really much better thanxorshift128 (but some people might like the idea of it havingsignificantly longer period).

Fantastic, I will see about implementing those. I wasn'tpreviously aware of that work, but I _was_ aware that thestandard Xorshift generators have some statistical flaws, so it'sgreat to have some alternatives. It should be straightforward toimplement things like XorshiftP128 or XorshiftS1024 andXorshiftS4096 (using P and S in place of + and *).

With these in place we might even be able to deprecate the oldXorshift generators.

Just for clarity, here's how I see things rolling out for thefuture:

* First goal is to ensure the existing codebase "plays nice"withpeople's programs and that it works OK with dub, rdmd, etc.anddoesn't have any serious architectural or other bugs. The1.0.0release will not have any new functionality compared to whatis

    in place now.

  * Once it seems to be reasonably stable then work can begin on a
    1.x release series that brings in successive pieces of new
    functionality.

Done :) ... if I get a response, I'll make sure to incorporateeverything said.


Great, let me know how that goes. :-)

Re: hap.random: a new random number library for D

Reply via email to