Re: New Lagged Fib. PRNG gen and random2.d

monarch_dodra Mon, 26 Aug 2013 08:32:09 -0700

On Sunday, 25 August 2013 at 16:58:12 UTC, Joseph RushtonWakeling wrote:


Nice! :-)


Thanks ;)

I think that as a general approach it's nice, but we shouldpreferably make generic the method of wrapping a payload.

To be honest, I think that, for now, this is an "implementationdetail". All our PRNG's are created with an explicit "valuetype", that can easily be made the "Payload", no matter what wedo.

I think this is a matter of taste, but I don't think it's badto force the user to seed things. Note that the existing RNGsgo to the extent of actually checking inside popFront() etc. ifthe RNG has been initialized, and if not, seeding it with adefault value.
The only thing I'll remark is that these default-seed ideas arequite prevalent in other RNG implementations in otherlanguages. Some of the existing unittests may even be based onthat default seeding. So, there may be some expectation ofthat option being available.

An "option" I had dabbled in was making the PRNG proper a"Container", which you can *slice* to obtain a high performancerange. EG:


//----
auto prng = Random(); //Creates a PRNG.
auto first = prng.front; //lazilly seeds
prng.popFront(); //Checks seeded
auto second = prng.front; //Cheks seeded again...

auto fastPRNG = prng[]; //Lazilly seeds, then returns a type*guaranteed* seeded.

auto third = fastPRNG.front; //No check of isSeeded.
//----

The initial reason I played around with this, is that it was anoption to "solve" the reference problem: Make the PRNG'sthemselves value types, and iterate using reference type*slices*. The end result (for "std.random1") has a couple *major*drawbacks though, IMO:1. Opt-*in*: Only those aware will use it. The rest of us mortalswill still get hit by the bugs.

2. Easily escapes references to locals.

3. As always, if we introduce std.random2, it'll be that muchmore to clean-up.

Still, I thought I'd share my ideas, even if they aren't the bestsolutions. It *is* an option for the (don't)auto-seed problem.

One more thing. One of the ways I see it (specifically for LF):We can add auto-seed later if we decide it was a bad idea to nothave it. The opposite isn't possible.

I think the two of us agree on the general principle that RNGsshould be re-done as structs that wrap a reference to apayload. Performance- and design-wise, I'd say that thefollowing things are worth considering:
(1) Should the memory management be done via GC (new) as inyourimplementation, or manually managed? Bear in mind thegeneraldesirable principle that Phobos code shouldn't rely onthe GC if it
      doesn't have to.
I don't know what impact the use of the GC here can beexpected to haveon overall performance of a piece of code, and whether itmight rule outuse of this design in highly performance-conscioususe-cases such as
      games, etc.
My own use of RefCounted to ensure manual memorymanagement (no GC)
      seems to have some scope-related issues, see:

http://forum.dlang.org/post/[email protected]
... and I assume that if payload was allocated manuallyvia malloc
      instead of new, then the same might be true.

Another option I had thought of, was to make the value-typesPayloads as public, with *massive* "do not use signs". *We* thenprovide simple and generic (and pure/nothrow) GC-based wrappers.

From there, if, for whatever reason, a user needs mallocallocation, or heck, static allocation, he still has an"underhand" access to the payload implementation, but lifecyclemanagement remains *his* responsibility.

I think: Simple and safe by default, with the possibility forextension.

(2) Should we provide a generic payload wrapper, or requireRNG creatorsto implement everything manually? I strongly support ageneric wrapper,as there is too great a risk of implementation error ifwe require
      the wrapping-of-a-reference to be done manually each time.
(3) As we discussed off-list, if we do implement a genericwrapper, how
      should it work?  Should it work as per my implementation,

          alias RandomGenerator!MyEngine MyRNG;

      or instead as you suggested, as a template mixin,

          struct MyRNG
          {
              mixin RandomGenerator!MyEngine;
          }
I can see the case for the latter as it will result inmuch more readabletype names in error messages. However, I think it hasthe potential toobscure implementation details in a way that isn'thelpful. Consider what
      happens if we do:

          alias RandomGenerator!MtEngine19937 Mt19937_64
// ... WRONG!! we forgot to tweak the engine when wecopy-pasted,// but at least we'll see in any error messages whattype of internal
          // engine is being used

      ... compared to:

          struct Mt19937
          {
              mixin RandomGenerator!MtEngine19937;
          }
// ... Copy-paste failure again, but this time it'sobscured and we'll// never find out unless we look at the actual sourcecode.

Again, for now, I think this is implementation detail. That said,I don't buy much into the typo argument. In particular, this issomething that gets copy pasted only once, so we should berelatively safe.

One of the arguments in favor of "internal" mixins is that ofextension: For example, laggedFib has a very real reason toimplement popFrontN, as it can potentially pop thousands ofelements at once in o(1). "Internal" mixin makes it easy toextend a PRNG's capabilities past that of the "lowest commondenominator". With a "alias Random =RandomGenerator!RandomPayload", you are really limited towhatever RandomGenerator wrapped.

(4) The devil's advocate position -- should we take thesimple route toreference-type RNGs by making them final classes? It'strivial to do butto me it feels "un-Phobos-ish" and will also have theproblem of requiringa lot more code rewrites on the part of std.random userswho want to
      upgrade to std.random2.

That seems like a good opening summary of issues -- destroy. :-)

Best wishes,

    -- Joe

Honestly, it might just be the simplest thing to do. For one, itwould elegantly solve the "must be seeded" issue (allocation isinitialization). It *guarantees* Reference semantics. Finally,the (supposed) overhead should be inexistant compared to thecompelxity of a PRNG.

The option of allowing "public Payload" definitions could stillleave an open door for those that need PRNG's, but don't use theGC (think vidjagames).

Re: New Lagged Fib. PRNG gen and random2.d

Reply via email to