Re: [sqlite] Re: SQLite and nested transactions

Darren Duncan Thu, 12 Apr 2007 16:29:07 -0700

At 9:48 AM -0600 4/12/07, Dennis Cote wrote:

Yes I did assume no coupling because you didn't suggest any. Ifthere is coupling this is just another case of the second example.

While I didn't explicitly suggest coupling before, I was making myarguments on the general case where actions against a database maypossibly be coupled, and my argument was towards solutions that workfor the general case. Sorry if I didn't communicate before that Iwas speaking to the general case.

So put all the sub-steps in a subroutine and call it.

In the general case, I don't control what the sub-steps are, but I ambeing a proxy for someone else, and I don't know in advance what theywould ask for. Also, as users may want data returned to them betweenthe sub-steps, their use for which could include determining whatsub-steps are, I can't just generate a subroutine at runtime toexecute, as then they wouldn't get anything back from theirintermediate queries on time. That said, I recognize that in somesituations it is possible for the stored procedure to embed all thedecision making logic necessary from the application, but this isn'talways true, as eg some user may be involved in intermediate steps.

I think that a SQLite pager-based mechanism for tracking childtransactions is quite a bit less complicated and more reliable thanusing your workaround, since no details have to be remembered butfor the pages that changed.
That is not true and you know it.


On the contrary, I believe what I said.

You are just pushing the complexity back to Richard. He will have toimplement the changes to the parser, code generation, pager layers,and test suite, as well as address the backwards compatibilityissues.

I don't see this as a problem. While it is true that a lot ofcomplexity can be layered on top of the DBMS rather than beinginternal to the DBMS, I see child transactions as something that isbest implemented inside the DBMS.

Speaking in a very loose analogy, I see the complexity as SQLite isnow compared to with child transaction support to be like replacing:


  foo();
  foo();

With:

  for ... {
    foo();
  }

That is, I see it as the difference between explicitly doingsomething twice, and doing it once but inside a loop.

So as one can refactor code to use loops rather than explicitrepeating, I don't see the end result here being much larger or moredifficult to maintain. That is, we aren't just adding code, but alsotaking away some that has become redundant, is how I conceptualize it.

So SQLite with child transactions is only trivially less lite thanit is now, which is still lite.
If it is a trivial as you suggest, then you should have alreadyprepared a patch. :-)

I wasn't saying that the patch itself was trivial (though I'm sayingit should be a simpler than the patches for many other requestedfeatures), but rather that the measures of how "lite" SQLite is wouldchange a trivially small amount between before and after.

In fact, I propose moving rollbackable child transaction support tothe top of the todo list, rather than it being in the middle, giventhat its presence can make a lot of other todo or wishlist itemsmuch easier to implement, I believe.
And if it will make a difference, I will even make a monetarydonation (as I can afford to) in order to sponsor its development(though I would like to think that the benefits are compelling ontheir own).
You will have to discuss this with Richard Hipp.


Yes, of course.  And I already did do that a few minutes after the list post.

How will nested transactions make creating a your wrapper easier?Please be specific.

Well, to help people better understand this, I should start butoutlining my own connected work.

I am writing a free and open source RDBMS of my own, whose maininnovations relative to the general DBMS field are in the queryengine, namely the public face (programmatic API and query language)that application developers and their users interact with. My RDBMShas its own query language and feature set which overlaps with butisn't the same as that of existing SQL DBMSs.

My RDBMS is structured as a framework with separate public interfaceand backend implementation layers (called "Interface" and "Engine),such that the backend is a swappable plugin-style component. The"interface" or wrappers thereof handle parsing user queries into anstandardized AST format, which is what an "Engine" takes as input andthe engine implements the AST-defined query however it wants. Thenative language and AST of my RDBMS define rigorous semantics whichusers should be able to expect, and which an Engine is supposed tocomply with.

Note that a single query in my language is a full-blown routinedefinition (which in the trivial case just contains a singlestatement), so what it does and what format of data it can processfor input or output is arbitrarily complex. All routine argumentsare named, and they serve a purpose analagous to SQL's "hostparameters" (aka, "bind variables"), which can exchange input andoutput with the application. A query-routine is separatelyprepared/compiled and executed by the application, the latterrepeatable as expected.

An Engine can either be self-contained, natively implementingeverything the AST specifies by itself, or it can implement a gluelayer to some other DBMS, translating the AST into queries native tothe other DBMS. The features of the underlying DBMS will beexploited as much as possible, so we get the best resourceefficiency, and only if the underlying DBMS can't do somethingnatively, does the Engine emulate the missing features (often a morecomplicated affair) over top of something else that the underlyingdoes support, in so much as is possible, so the user still gets thefeatures.

In the general case, my own focus is on the Interface, which is theframework core, and each Engine is made and distributed separately bya third party, citing the core as its main external dependency.However, there is one "Example" Engine that I bundle with theinterface so that it is possible to thoroughly test the core inisolation from external dependencies.

The framework core will also have a "Validator" test suite, akin toSun's certification suite for Java implementations, that uses thewhole RDBMS API against a user-specified Engine to check that itbehaves to spec. Any third party Engine can generally just invokethis test suite in 'make test' rather than having to write up its owntest suite. Validator itself is tested using Example.

The core-bundled Example Engine is naively implemented and focuses ondelivering the correct semantics for all language features, butdoesn't worry about being scalable; its main purpose is to serve as aproof of concept to study from, but that it can actually be used as atest platform when building applications over my RDBMS.

But I will also develop or co-develop several other Engines (not justdepending on third parties for said), to be distributed separately,which are intended more for handling larger workloads in a typicalproduction environment; they are non-naive, at a cost of being morecomplex.

Generally speaking, I leave the development of storage layers, and/ormodules that are more concerned with the low level details, toothers, as they are a lot more innovative, expert, and interested inthis area than I am, while I focus on the user experience side ofthings.


So this is where SQLite comes in.

One of the, probably *the*, non-Example Engines will do its job usingSQLite, and it will try to do so in the most optimal way possible,using all the native and high-level features that SQLite offers totheir full advantage, as is possible. In large part, it willgenerate SQLite-flavor SQL from my language's AST, have SQLiteexecute it, and package any bind variables or output. If a singleSQLite query

It stands to reason, then, that my job in implementing my RDBMS overSQLite will be easier if SQLite natively supports the fundamental /conceptually low-level features that my RDBMS does, so mapping willbe fairly straight forward.

For example, since SQLite 3 supports prepared statements, and namedbind variables, a prepare() against my DBMS turns into agenerate-and-prepare-SQL against SQLite, a variable-bind against myDBMS turns into one against SQLite. An execute() against my DBMSturns into one against SQLite. Assuming SQLite supports multiplesimultaneous prepared statements, then any multitude of SQLstatements that my routine may turn into (when I can't just do thepreferred action of using a single SQL statement defining a SQLstored procedure or nested query), will be a multitude of SQLiteprepared statements, which are then simply executed in sequence byexecute(). This is more efficient than instead having to bothprepare+execute SQL against SQLite in my execute() were it not tohave its own prepared statement support.


Now, about child transactions.

Part of my own RDBMS' feature set is that users can define their ownarbitrarily complex data types, operators/routines, and databaseconstraints (common examples being unique key and foreign keyconstraints), which then can live in the database and just beavailable to its users like built-ins. But besides that, my RDBMShas a native set of types and operators that are different from thosein SQL. It also supports multiple-assignment statements (eg, makechanges to multiple tables in a single statement), and updateableviews; insofar as possible, views are treated the same as basetables. It also supports performing data-definition by performingdata-manipulation against the information schema (in fact, typicalDDL is short-hand for doing that).

All database constraints in my RDBMS are immediate, and are appliedbetween statement boundaries at all levels, so no statement will eversee a version of the database that is inconsistent / violates anyconstraints. And so then what, you may ask do we do if we have aconstraint saying one table must be credited while another debited,and the constraint shouldn't apply between the first and second step;well to that I say, is what single multi-update statements are for;the 2 updates are conceptually happening "at the same time".

My RDBMS is ACID in the strictest sense. It uses implicittransactions everywhere. Every operator/routine is implicitly atomicand hence a transaction. Every statement at any level of the callstack is atomic. Any explicit transaction within most types ofroutines takes the form of a try-style code block. So generally, thenumber of transaction layers is equal to the depth of the call stack.There are no standalone "start/commit/rollback" statements except inthe parent-most anonymous routine in the callstack that theapplication directly invokes; all "stored" routines use the block orimplicit form only, so we ensure no dangling transactions. Anystatement failure, which can be due to the statement violating adatabase constraint, will throw an exception, which will rewind theroutine call stack (and transaction stack) one at a time until someroutine or block catches it. If a statement/routine completes/exitsnormally, its implicit transaction commits; it implicitly rolls backif an exception causes it to exit early. When an exception iscaught, the transaction layers of the catching statement and itsparents have not rolled back and can still be committed, and newchild transactions can still be started, such as a "try againdifferently" on the failure.

Suffice it to say that it will be a lot easier for me if theimplementation of each operator and routine et al in my languageagainst SQLite can simply issue "start/commit/rollback" transactionat its start/end as is appropriate, which would happen nicely ifSQLite has native child transaction support.

If SQLite doesn't support child transactions, I would have to add thecomplexity of remembering everything that was done by the parent-mostroutine and keeping track of level counts and what-have you, which isa real pain.

Oh, and in case you say that I'm already managing it myself withExample, then I would say you are right, however in that case,Example's analogy to SQLite's pager layer is entirely built-in to it,and so I am implementing the feature right where it should be, there,and Example only has to track what pages changed, not a list ofexecuted statements, which is a lot simpler.

So what I'm proposing for SQLite is no less than what I would expectto do myself.

How much more complicated is the nested transaction solution if*you* have to implement it?

If you mean, in my own SQLite-using program, hopefully this has nowbeen explained between my various posts.

If you mean, my implementing it in SQLite itself, that is highlyimpractical since I'm very poor at C and the regular maintainers ofSQLite would be able to do the job many orders of magnitude fasterthan I could.

My contribution to the development of SQLite is mainly on the side ofdesign suggestions (which are generally programming languageagnostic) and what things are helpful from the users' perspective.


-- Darren Duncan

-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------

Re: [sqlite] Re: SQLite and nested transactions

Reply via email to