Is it possible that the first call to random is cached and the cached value is being returned in subsequent calls?

-------------------------
Scott Doctor
sc...@scottdoctor.com
-------------------------

On 12/8/2017 12:09, John McKown wrote:
On Fri, Dec 8, 2017 at 12:54 PM, John Mount <jmo...@win-vector.com> wrote:

I am seeing an issue where a random value in a sub-query gets re-generated
each time the value is used in later queries.  Below is an example query:

SELECT r AS r1, r AS r2 FROM ( SELECT random() AS r FROM ( SELECT * from (
VALUES(1),(2) ) ) a ) b

One would expect r1 == r2.

---------------
John Mount

​Interesting. I duplicated your example using PostgreSQL and it does what
you said you expected: R1==R2. In SQLite3, the .eqp full shows:

sqlite> .eqp full
sqlite> SELECT r AS r1, r AS r2 FROM ( SELECT random() AS r FROM ( SELECT *
from ( VALUES(1),(2) ) ) a ) b;
--EQP-- 0,0,0,SCAN SUBQUERY 1
addr  opcode         p1    p2    p3    p4             p5  comment
----  -------------  ----  ----  ----  -------------  --  -------------
0     Init           0     0     0                    00  Start at 0
1     InitCoroutine  1     7     2                    00
sqlite_sq_5619D81F9BF8
2     Integer        1     2     0                    00  r[2]=1
3     Yield          1     0     0                    00
4     Integer        2     2     0                    00  r[2]=2
5     Yield          1     0     0                    00
6     EndCoroutine   1     0     0                    00
7     InitCoroutine  1     0     2                    00
8       Yield          1     13    0                    00  next row of
"sqlite_sq_5619D81F9BF8"
9       Function0      0     0     4     random(0)      00  r[4]=func(r[0])
10      Function0      0     0     5     random(0)      00  r[5]=func(r[0])
11      ResultRow      4     2     0                    00  output=r[4..5]
12    Goto           0     8     0                    00
13    Halt           0     0     0                    00
4548137244590923354  8821858240296964415
761559492082550893  2723588653195689097
​
I think this is being done due to the "subquery flattening" as described on
http://sqlite.org/optoverview.html, which says:

To overcome this problem, SQLite attempts to flatten subqueries in the FROM
clause of a SELECT. This involves inserting the FROM clause of the subquery
into the FROM clause of the outer query and rewriting expressions in the
outer query that refer to the result set of the subquery. For example:

So your "SELECT r AS r1, r AS r2 FROM ( SELECT random() AS r FROM ( SELECT
* from ( VALUES(1),(2) ) ) a ) b;" ​be​comes

"SELECT random() AS r1, random() AS r2 FROM ( SELECT * from (values(1),(2))
a) b;"

I think the above is born out if you put the word DISTINCT in front of the
random() in the original example. This forces SQLite to _not_ use subquery
flattening. Example:

sqlite> SELECT  r AS r1, r AS r2 FROM ( SELECT distinct random() AS r FROM
( SELECT * from ( VALUES(1),(2) ) c ) a ) b;
--EQP-- 1,0,0,SCAN SUBQUERY 2 AS c
--EQP-- 1,0,0,USE TEMP B-TREE FOR DISTINCT
--EQP-- 0,0,0,SCAN SUBQUERY 1 AS b
addr  opcode         p1    p2    p3    p4             p5  comment
----  -------------  ----  ----  ----  -------------  --  -------------
0     Init           0     0     0                    00  Start at 0
1     InitCoroutine  1     18    2                    00
sqlite_sq_55E270A58EA8
2     InitCoroutine  2     8     3                    00
sqlite_sq_55E270A58AA8
3     Integer        1     3     0                    00  r[3]=1
4     Yield          2     0     0                    00
5     Integer        2     3     0                    00  r[3]=2
6     Yield          2     0     0                    00
7     EndCoroutine   2     0     0                    00
8     OpenEphemeral  3     0     0     k(1,B)         08  nColumn=0
9     InitCoroutine  2     0     3                    00
10      Yield          2     17    0                    00  next row of
"sqlite_sq_55E270A58AA8"
11      Function0      0     0     5     random(0)      00  r[5]=func(r[0])
12      Found          3     16    5     1              00  key=r[5]
13      MakeRecord     5     1     6                    00  r[6]=mkrec(r[5])
14      IdxInsert      3     6     0                    00  key=r[6]
15      Yield          1     0     0                    00
16    Goto           0     10    0                    00
17    EndCoroutine   1     0     0                    00
18    InitCoroutine  1     0     2                    00
19      Yield          1     24    0                    00  next row of
"sqlite_sq_55E270A58EA8"
20      Copy           5     7     0                    00  r[7]=r[5];
sqlite_sq_55E270A58EA8.r
21      Copy           7     8     0                    00  r[8]=r[7]
22      ResultRow      7     2     0                    00  output=r[7..8]
23    Goto           0     19    0                    00
24    Halt           0     0     0                    00
920225462863128947|920225462863128947
-723158119245037038|-723158119245037038

​Hope this at least explains what is happening. I don't know if this is
"correct" or not.​



_______________________________________________
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

Reply via email to