Re: [HACKERS] [WIP] Zipfian distribution in pgbench

Fabien COELHO Sun, 13 Aug 2017 10:58:33 -0700


Hello Alik,

Now “a” does not have upper bound, that’s why on using iterative algorithm with a 
>= 10000 program will stuck on infinite loop because of following line of code:
double b = pow(2.0, s - 1.0);
Because after overflow “b” becomes “+Inf”.


Yep, overflow can happen.

So should upper bound for “a" be set?

Yes, I agree. a >= 10000 does not make much sense... If you want uniformyou should use random(), not call random_zipfian with a = 10000. Basicallyit suggests that too large values of "a" should be rejected. Not surewhere to put the limit, though.

Should I mention in docs that there are two algorithms are useddepending on values of a(s/theta)?

Yes, as a general principle I think that the documentation should reflectthe implementation.

In attaching patch, I have added computeIterativeZipfian method and it’susage in getZipfianRand. Is it better to move code of computing viacache to new method, so that getZipfianRand will contain only 2computeXXXZipfian method calls?

I have not looked in detail, but from what you say I would agree that theimplementation should be symmetric, so having one function calling onemethod or the other sounds good.


--
Fabien.
--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [WIP] Zipfian distribution in pgbench

Reply via email to