Re: [sqlite] shared cache/ test_server.c

John Stanton Wed, 25 Jul 2007 13:53:24 -0700

Now you have made me worry. If cache can only be shared by connectionscreated in one thread then there is no shared cache. I must investigatethis more closely. Perhaps my reading of the documentation included adose of wishful thinking and a belief that "shared" meant shared!Looking through the code shows shared cache mode introducing tablelocking and gives the impression that "shared cache mode" is actually animplementation of finer granularity locking, to table level, and themultiple connections just store the cursor context state.

I shall need to write some test programs to be certain, but at thisstage it does look like there is no shared cache mode as such and that aserver needs to single stream Sqlite access in one thread to avoidhaving large amounts of data replicated in memory, but at the cost ofrestricting concurrent read access.

Shared cache mode would be better named "persistent cache mode" becauseits main effect is to permit one thread to not flush the cache aftereach transaction. The people at Mozilla report that they use it and getbetter throughput on small transactions.


Thankyou to those people who contributed to this discussion.

Ken wrote:

John,
According to the Sqlite documentation on sqlite3_enable_shared_cache:
 "There is no mechanism for sharing cache between database connections running in 
different threads."
This means exactly what I said in the first place: You cannot have a "shared cache" access across threads. I really wish that you could have multiple threads each with a database connection using shared cache running concurrently.Can you provide sample code showing the concept you are describing?I totally understand what you are getting at with the locking. Indeed handling locking internally in memory will always be faster (assuming speed of ram access is faster than disk I/O).John Stanton <[EMAIL PROTECTED]> wrote: I think that you misunderstood the shared cache description. Cache isshared by many connections but connections may not be passed betweenthreads. Each thread must maintain and use its its own connection. Inour case a thread has an associated control block and the connectionhandle resides there.
As long as you only access the Sqlite connection from the thread whichcreated it you share the cache and it works fine.
The locking is a seperate issue and is aimed at avoiding the dreaded"busy wait". We use a leisurely busy wait to handle mutli-processSqlite using file locks. The technique is to not delay after a busy isintercepted but to force a process time slice yield but in a server ourintention is to avoid these inefficiencies by using the more efficientsynchronization features. As you would appreciate a few percent betterefficiency on your server means a corresponding increase in the numberof possible users.
Ken wrote:
John,The server can maintaine a "shared cache" but if a thread also opens the DB then that execution line will not have a "shared cache" but rather a cache per thread. Only the server thread may open and act upon the connection utilizing a shared cache on behalf of the client. The client may not do things unto the connection handle such as open, prepare, step, reset, close, finalize.
At least thats my understanding of the "shared_cache" mode.

Using a locking primitive internally does simplify the code. But I'll contend 
that if you are using multiple threads and each having a connection to a DB 
with a locking structure for internal synchronization. Then you are not using 
the sqlite shared cache. And you will not benefit from sqlites locking 
internals (read/writer starvation ). And if it is write intensive and 
concurrent you might as well have a single connections that is shared across 
all threads.

I guess my point was that inside the server thread, once a transaction is 
entered upon behalf of a client then only that activity may continue and no 
others. So in my design i only had two choices, re-enqueu the message inside 
the server until the transactional thread completed or return an error to the 
client. I preferred keeping the message on the queue waiting to be serviced. 
This is also programatically a pain in the arse since you must guarantee the 
client doesn't abandon its responsiblities and exit without sending a close 
command into the server thread, resulting in a permanently blocked server queue.

You can test this behavouir using the src/test_server.c code and some client 
connections into the test_server thread.
Or I may just be totally off my rocker.. and thats ok too.Ken
John Stanton wrote: That is why the Sqlite locking is not a good fit for a threaded server.Why not use thread locks instead and achieve the synchronization withminimum overhead and latency? You do miss out on a couple of Sqlitefeatures doing that (the pending and reserved locks which help withconcurrency and write starvation) so you need to balance the benefits ofthem against the detrimental effects of polling.
In our older embedded Sqlite threaded applications we just serializedSqlite access using a mutex because concurrency was not a prime issue,but use read/write locks in a higher traffic Sqlite based multi-threadedapplication server.
After experimentation, which included some erroneous attempts at cachesharing we have a strategy in place which uses Sqlite shared cache andassigns a rwlock to each open database. Each thread has its own DBconnection with a pointer to the locking structure for the opendatabase. That gives good throughput since it holds each database openwhile the server runs and maintains one cache per database, acceleratingreads. The downside is that we have to figure out a replacement for theFTS2 accesses used for text searching.
Since we no longer user POSIX file locking we compile Sqlite without itto trim some redundant overhead.
It looks like we can replace FTS by user functions using a text indexingmethod recycled from another product.
The server in question services AJAX style WWW pages where there arelarge numbers of short read transactions and minimum latency is requiredto achieve a snappy response. It manages to achieve sub-millisecondresponses to database RPC's from the WWW browser.
BTW, with help from this forum we realized that our attempts to achieveshared cache and FTS were doomed to fail for fundamental architecturereasons and abandoned the effort. In retrospect we were trying toimplement PostgreSQL with Sqlite and that was not a rational project.
The Sqlite based application server allows a central site to supportmany databases, each one specific to sets of users located globally.Sqlite's single file databases make this very simple to administer.Each database does not have a large number of users, relieving theconcurrency load.
For further background on using Sqlite this way look at the way Mozillaimplements it using shared cache.
Finally, it is important to recognize that Sqlite id not Oracle, it is awell conceived kit of tools to permit a developer to embed SQL databasecapability into an application and to make it fit transparently. Thedeveloper has the source and nothing is chiselled in stone.
Ken wrote:
John,
The sqlite api won't block, it will return a sqlite_busy type error to any 
other transactions that are attempted? Correct, so there is no sqlite blocking 
which is a good thing when writing a server. The clients will always block 
waiting upon a response from the server. The server simply keeps the client 
requests enqueued until it can service them some time later.
-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------
-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------
John Stanton <[EMAIL PROTECTED]> wrote: I think that you misunderstood the shared cache description. Cache isshared by many connections but connections may not be passed betweenthreads. Each thread must maintain and use its its own connection. Inour case a thread has an associated control block and the connectionhandle resides there.
As long as you only access the Sqlite connection from the thread whichcreated it you share the cache and it works fine.
The locking is a seperate issue and is aimed at avoiding the dreaded"busy wait". We use a leisurely busy wait to handle mutli-processSqlite using file locks. The technique is to not delay after a busy isintercepted but to force a process time slice yield but in a server ourintention is to avoid these inefficiencies by using the more efficientsynchronization features. As you would appreciate a few percent betterefficiency on your server means a corresponding increase in the numberof possible users.
Ken wrote:
John,The server can maintaine a "shared cache" but if a thread also opens the DB then that execution line will not have a "shared cache" but rather a cache per thread. Only the server thread may open and act upon the connection utilizing a shared cache on behalf of the client. The client may not do things unto the connection handle such as open, prepare, step, reset, close, finalize.
At least thats my understanding of the "shared_cache" mode.

Using a locking primitive internally does simplify the code. But I'll contend 
that if you are using multiple threads and each having a connection to a DB 
with a locking structure for internal synchronization. Then you are not using 
the sqlite shared cache. And you will not benefit from sqlites locking 
internals (read/writer starvation ). And if it is write intensive and 
concurrent you might as well have a single connections that is shared across 
all threads.

I guess my point was that inside the server thread, once a transaction is 
entered upon behalf of a client then only that activity may continue and no 
others. So in my design i only had two choices, re-enqueu the message inside 
the server until the transactional thread completed or return an error to the 
client. I preferred keeping the message on the queue waiting to be serviced. 
This is also programatically a pain in the arse since you must guarantee the 
client doesn't abandon its responsiblities and exit without sending a close 
command into the server thread, resulting in a permanently blocked server queue.

You can test this behavouir using the src/test_server.c code and some client 
connections into the test_server thread.
Or I may just be totally off my rocker.. and thats ok too.Ken
John Stanton wrote: That is why the Sqlite locking is not a good fit for a threaded server.Why not use thread locks instead and achieve the synchronization withminimum overhead and latency? You do miss out on a couple of Sqlitefeatures doing that (the pending and reserved locks which help withconcurrency and write starvation) so you need to balance the benefits ofthem against the detrimental effects of polling.
In our older embedded Sqlite threaded applications we just serializedSqlite access using a mutex because concurrency was not a prime issue,but use read/write locks in a higher traffic Sqlite based multi-threadedapplication server.
After experimentation, which included some erroneous attempts at cachesharing we have a strategy in place which uses Sqlite shared cache andassigns a rwlock to each open database. Each thread has its own DBconnection with a pointer to the locking structure for the opendatabase. That gives good throughput since it holds each database openwhile the server runs and maintains one cache per database, acceleratingreads. The downside is that we have to figure out a replacement for theFTS2 accesses used for text searching.
Since we no longer user POSIX file locking we compile Sqlite without itto trim some redundant overhead.
It looks like we can replace FTS by user functions using a text indexingmethod recycled from another product.
The server in question services AJAX style WWW pages where there arelarge numbers of short read transactions and minimum latency is requiredto achieve a snappy response. It manages to achieve sub-millisecondresponses to database RPC's from the WWW browser.
BTW, with help from this forum we realized that our attempts to achieveshared cache and FTS were doomed to fail for fundamental architecturereasons and abandoned the effort. In retrospect we were trying toimplement PostgreSQL with Sqlite and that was not a rational project.
The Sqlite based application server allows a central site to supportmany databases, each one specific to sets of users located globally.Sqlite's single file databases make this very simple to administer.Each database does not have a large number of users, relieving theconcurrency load.
For further background on using Sqlite this way look at the way Mozillaimplements it using shared cache.
Finally, it is important to recognize that Sqlite id not Oracle, it is awell conceived kit of tools to permit a developer to embed SQL databasecapability into an application and to make it fit transparently. Thedeveloper has the source and nothing is chiselled in stone.
Ken wrote:
John,
The sqlite api won't block, it will return a sqlite_busy type error to any 
other transactions that are attempted? Correct, so there is no sqlite blocking 
which is a good thing when writing a server. The clients will always block 
waiting upon a response from the server. The server simply keeps the client 
requests enqueued until it can service them some time later.
-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------
-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------



-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------

Re: [sqlite] shared cache/ test_server.c

Reply via email to