Re: pgbench - add pseudo-random permutation function

Fabien COELHO Thu, 14 Feb 2019 13:18:18 -0800


Hello Andres,

+# PGAC_C_BUILTIN_CLZLL


I think this has been partially superceded by

commit 711bab1e4d19b5c9967328315a542d93386b1ac5
Author: Alvaro Herrera <alvhe...@alvh.no-ip.org>
Date:   2019-02-13 16:10:06 -0300


Indeed, the patch needs a rebase & conflit resolution. I'll do it. Later.

   <para>
+    Function <literal>pr_perm</literal> implements a pseudo-random permutation.
+    It allows to mix the output of non uniform random functions so that
+    values drawn more often are not trivially correlated.
+    It permutes integers in [0, size) using a seed by applying rounds of
+    simple invertible functions, similarly to an encryption function,
+    although beware that it is not at all cryptographically secure.
+    Compared to <literal>hash</literal> functions discussed above, the function
+    ensures that a perfect permutation is applied: there are no collisions
+    nor holes in the output values.
+    Values outside the interval are interpreted modulo the size.
+    The function errors if size is not positive.
+    If no seed is provided, <literal>:default_seed</literal> is used.
+    For a given size and seed, the function is fully deterministic: if two
+    permutations on the same size must not be correlated, use distinct seeds
+    as outlined in the previous example about hash functions.
+  </para>


This doesn't really explain why we want this in pgbench.


Who is "we"?

If someone runs non uniform tests, ie with random_exp/zipf/gauss, closevalues are drawn with a similar frequency, thus correlated, inducing anundeserved correlation at the page level (eg for read) and betterperformance that would be the case if relative frequencies were notcorrelated to key values.

So the function allows having more realistic non uniform test, whereascurrently we can only have non uniform test with very unrealisticallycorrelated values at the key level and possibly at the page level, meaningnon representative performances because of these induced bias.

This is under the assumption that pgbench should allow more realisticperformance test scenarii, which I believe is a desirable purpose. Ifsomeone disagree with this purpose, then they would consider both nonuniform random functions and this proposed pseudo-random permutationfunction as useless, as probably most other additions to pgbench.


--
Fabien.

Re: pgbench - add pseudo-random permutation function

Reply via email to