Re: [sqlite] Next Version of SQLite

John Stanton Tue, 15 Jan 2008 15:55:10 -0800

[EMAIL PROTECTED] wrote:

Joe Wilson <[EMAIL PROTECTED]> wrote:

--- "D. Richard Hipp" <[EMAIL PROTECTED]> wrote:
Sorry for the confusion.
No problem.
For what it's worth, I am also curious as to the final form of theVM opcode transformation. The number of opcodes generated by the variousSQL statements seems to be roughly the same as the old scheme. At thispoint without sub-expresssion elimination are you seeing any speedimprovement?


I have not even looked at performance yet.  I'm assuming that
performance will drop during the conversion process and that we
will have to fight to get it back up to previous levels after
the conversion is complete.

But consider would can be done with a register machine that
would couldn't do with the old stack machine.  In a statement
like this:

    SELECT * FROM a NATURAL JOIN b;

Suppose tables a and b have column c in common and unique

columns a1, a2, a3 and b1, b2, b3. With the stack machine,the algorithm is roughly this:


    foreach each entry in a:
      foreach entry in b with b.c==a.c:
        push a.c
        push a.a1
        push a.a2
        push a.a3
        push b.b1
        push b.b2
        push b.b3
        return one row of result
      endforeach
    endforeach

For each result row, all columns had to be pushed onto the
stack.  Then the OP_Callback opcode would fire, causing
sqlite3_step() to return SQLITE_ROW.  The result columns would
then be available to sqlite3_column_xxx() routines which read
those results off of the stack.  When sqlite3_step() is called

again, all result columns are popped from the stack andexecution continues with the first operation after theOP_Callback.


In the register VM, result columns are stored in a consecutive
sequence of registers.  It is no longer necessary to pop the
stack of prior results at the start of each sqlite3_step().
So the code can look more like this:

    foreach entry in a:
      r1 = a.c
      r2 = a.a1
      r3 = a.a2
      r4 = a.a3
      foreach entry in b where c=r1:
        r5 = b.b1
        r6 = b.b2
        r7 = b.b3
        return one row of result
      endforeach
    endforeach

When result are stored in registers, the computation of thefirst four columns of the result set can be factored out ofthe inner loop. If there are 10 matching rows in b for every

row in a, this might result in a significant performance boost.

Do not look for this improvement right away, though.  The
first order of business is to get the VM converted over into
a register machine.  Only after that is successfully accomplished
will we look into implementing optimizations such as the above.

--
D. Richard Hipp <[EMAIL PROTECTED]>

This architecture replacement is a very significant step for Sqlite frommy perspective. It indicates a transition from the simple, more easilyimplemented technology into a more refined one providing the basis foradding more sophistication in optimization and a generally moreefficient core.


Such ongoing evolution guarantees that Sqlite will not ossify and become
irrelevant.  Congratulation Dr Hipp and team for not leaving well alone.


-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------



-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------

Re: [sqlite] Next Version of SQLite

Reply via email to