Re: [HACKERS] extend pgbench expressions with functions

Fabien COELHO Wed, 04 Nov 2015 01:07:51 -0800


Hello Robert,

1. I think there should really be two patches here, the first adding
functions, and the second adding doubles.  Those seem like separate
changes.  And offhand, the double stuff looks a lot less useful that
the function call syntax.

I first submitted the infrastructure part, but I was asked to show howmore functions could be included, especially random variants. As randomgaussian/exponential functions require a double argument, there must besome support for double values.

Now, it could certainly be split in two patches, but this is ratherartificial, IMO.

2. ddebug and idebug seem like a lousy idea to me.


It was really useful to me for debugging and testing.

If we really need to be able to print stuff from pgbench, which I kindof doubt, then maybe we should have a string data type and a print()function, or maybe a string data type and a toplevel \echo command.

The *debug functions allow to intercept the value computed within anexpression. If you rely on variables and some echo (which does not exist)this means that there should be double variables as well, which is notcurrently the case, and which I do not see as useful for the kind ofscript written for pgbench. Adding the string type is more work, and

I do not see a good use case for those.

So the *debug functions are really just a lightweight solution fordebugging type related issues in expressions. I can drop them if this is ablocker, but the are really useful for testing quickly a script.

3. I'm perplexed by why you've replaced evaluateExpr() with evalInt()
and evalDouble().

As explained above in the thread (I think), the reason is that having oneoverloaded expression evaluation which handles types conversion wouldproduce pretty heavy code, and the two functions with the descendingtyping allows to have a much smaller code with the same effect.

The issue is that with two types all functions must handle argument typeconversion explicitely.

For instance for "random_gaussian(int, int, double)", it may be calledwith any combination of 3 int/double arguments, each one must be testedand possibly converted to the target type before calling the actualfunction. For overloaded operators or functions (arithmetics, abs...)there is also the decision about which operator is called and then whatconversions are necessary.

With the descending typing and two functions cross recursion all theseexplicit tests and conversion disappear because the function evaluationcalls evalInt or evalDouble depending on the expected types.


Basically, the code is significantly shorter and elegant with this option.

That doesn't seem similar to what I've seen in otherexpression-evaluation engines.

Probably. This is because I choose a descending typing to simplify theimplementation. Changing this would bring no real practical benefit fromthe usage point of view, but would add significant more verbose and uglycode to test and handle type conversions everywhere, so I'm not keen to dothat.

Perhaps I could find out by reading the comments, but actually not,because this entire patch seems to add only one comment:
+       /* reset column count for this scan */


There are a few others, really:-)

While I'm not a fan of excessive commenting, I think a little more
explanation here would be good.

I can certainly add more comments to the code, especially around the evalcross recursion functions.

4. The table of functions in pgbench.sgml seems to leave something to
be desired.  We added a pretty detailed write-up on the Gaussian and
exponential options to \setrandom, but exporand() has only this
description:

Yep. The idea was *not* to replicate the (painful) explanations aboutrandom functions, but that it should be shared between the function andthe \set variants.

+      <row>
+       <entry><literal><function>exporand(<replaceable>i</>,
<replaceable>j</>, <replaceable>t</>)</></></>
+       <entry>integer</>
+       <entry>exponentially distributed random integer in the bounds,
see below</>
+       <entry><literal>exporand(1, 10, 3.0)</></>
+       <entry>int between <literal>1</> and <literal>10</></>
+      </row>

That's not very helpful.


The table explanation must be kept short for the table format...

Without looking at the example, there's no way to guess what i and jmean, and even with looking at the example, there's no way to guess whatt means. If, as I'm guessing, exporand() and guassrand() behave like\setrandom with the exponential and/or Gaussian options, then thedocumentation for one of those things should contain all of the detailedinformation and the documentation for the other should refer to it.


Indeed, that was the idea, but it seems that I forgot the pointer:-)

More than likely, exporand() and gaussrand() should get the detailedexplanation, and \setrandom should be document as a deprecatedalternative to \set ... {gauss,expo,}rand(...)

Ok, the description can be moved to the function part and the \setversion reference the other.

5. I liked Heikki's proposed function names random_gaussian(min, max,
threshold) and random_exponential(min, max, threshold) better than the
ones you've picked here.  I think random() would be OK instead of his
suggestion of random_uniform(), though.

Ok.

I'll submit an updated version of the patch, which addresses points 4 & 5and documents 3, and wait for feedback on the explanations I gave beforedoing anything for about 1 & 2, as I think that the implied changes arenot desirable. I'm not keen at all on changing the cross recursionimplementation (3), this would just be pretty ugly code without actualuser benefit.


--
Fabien.


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] extend pgbench expressions with functions

Reply via email to