Re: [elephant-devel] Understanding real-world use of Elephant

Ian Eslick Thu, 24 May 2007 18:18:42 -0700

As a user of Elephant, you really shouldn't have to worry too muchabout threading so long as you follow the simple rules laid out inthe manual under "multi-threading". I think you are trying tounderstand how we make this possible since it seems harder from yourread of the acache interface.

A simple conceptual model is that each thread has its owntransaction. If these transactions are executing concurrently, theBDB library or SQL server logs the side read dependencies and theside effecting writes until a commit or abort. On abort, you throwaway the log. On a commit, one transaction at a time writes theirside effects and cancels any transaction that has read or written oneof the entries written by the committing transaction.

Thread isolation is guaranteed by a policy for using the BDB libraryor SQL server. Calls to these libraries are re-entrant. The currenttransaction ID (only used by one thread) determines where the readsand writes are logged (this is a little more complex for SQL, buthandled by the data store and transparent to the user).

The only other thing we need to do is make sure that the parts ofelephant shared across threads are themselves protected by Lisplocks. Most of this is the store-controller and some data structuresused to initialize a new store controller.

If you stick to the user contract in the documentation, you shouldn'thave to worry further about interactions of multiple threads (otherthan the usual considerations if you have shared lisp variablesinstead of shared persistent objects).

Please let me know what would need to be clarified in thedocumentation to make this clearer, or ask pointed questions if theabove was not a sufficient explanation. After all, the user isalways right!


Thank you,
Ian


On May 24, 2007, at 7:46 PM, Robert L. Read wrote:

I would have to disagree about the documentation for Elephant notbeing abundant---Ian has written a 118 page manual.
Nonetheless, you are correct that the use of Elephant in a multi-threaded webserver environment is not heavily documented. Ian andI have discussed the need for a killer "example app" and eagerlyawait someone contributing one.
I'm a little rusty on some of this, but if you will agree to takemy opinion about skeptically, I will describe my thoughts on thesubject. My own application, Konsenti, is a multi-threaded webapplication.
In the first place, one has to go back to basics a little bit.Whenever you have concurrency, you have to have concurrencycontrol. Personally, I think to think of this at the object level,but I know it is now common to think of it at the "databaselevel". You are generally correct that if you are using SQL (orBDB, for that matter) as a database and you keep ALL of your statein the database, then you can generally rely on the concurrencycontrol of those systems to serialize everything and not allow twothreads to interfere with each other. However, almost ANYapplication will have to think about concurrency; if your are SQL-orieneted, you will do this by defining "transactions". If youdefine your transactions wrong, you will have concurrency errors,even though the database serializes transactions perfectly.
For example, generally, since the Web forces us into "page based"or "form based" interaction (which javascript, CSS and Ajaxpromptly complicate), one can generally think of a web applicationsas "one-page turn == one transaction". But even that might not betrue---you could take 10 page turns to submit an order, and theorder must be atomic---that is, you either order the thing with 10pages of specifications, or you don't order it. A half-order is acorrupt order.
Elephant has the "with-tranaction" macro. This is really the bestway to use Elephant in most circumstances --- but even then, if youare keeping ANYTHING in memory, whether to cache it for speed orbecause you don't want to put it in the database (session-basedinformation would be a typical example), you may still have to"serialize" access to that state. That unavoidable means that youhave to understand some sort of mutual exclusion lock;unfortunately, these are not 100% standard across different lisps.However, most lisps do provide a basic "with-mutex" facility. Iuse this in the DCM code (in the contrib directory) to serializeaccess to a "director", which is very similar to a version of Ian'spersistent classes, but based on a "keep it all in memory and pushwrites to the DB" model (that is, Prevalence.)
So before we can really answer you question deeply, I think wewould have to understand more about what you are doing.
If you will forgive me over-simplifying things, if:
1) You can legitimately think of every page turn as a transaction, and
2) You keep all of the meaningful state in the Elephant DB, and
3) You wrap your basic "process-a-page" function in "with-transaction",
then you won't have a concurrency control problem.
That is a completely appropriate style for certain relativelysimple web-applications; for other kinds of web-applications itwould be very constraining and slow --- but one should neveroptimize for speed before it is necessary.
I don't know if that's a useful answer --- if you can refine yourquestion, I will try again.
On Thu, 2007-05-24 at 18:08 -0400, [EMAIL PROTECTED] wrote:
Hi all, I'm still on my quest to learn to effectively useElephant. Although documentation is not so abundant, I've gotten apretty good start with the available documentation. However, Idon't have such a strong background on using ODBs and mainly comefrom the SQL world. So, just for curiousity's sake, I read thetutorial for AllegroCache which tries to show the "proper" way touse AllegroCache in real-world systems (http://www.franz.com/products/allegrocache/docs/acachetutorial.pdf). Well, conceptwise,I followed everything in that tutorial and could pretty muchrelate everything to Elephant. However, the truth is that Ithought there was a tremendous "overhead" in using AllegroCachewhen dealing with multiple threads. The reality is that theapplications we are thinking on using Elephant for are web-basedapplications served by multiple web servers, so the practicalityof the tutorial was more inline to our objective use of Elephant.So, I figured that both systems (Elephant and AllegroCache) havemore or less the same usage for practical purposes. But, none ofthe Elephant documentation I've read on Elephant has even referredto the things mentioned in the tutorial when dealing with multiplethreads. I have read on the list that Elephant is thread safe, butam wondering if anyone could help me understand how Elephant wouldbe different from AllegroCache. For example, do I still need tohandle connection pools and thread locks and all of that in orderto do simple multi-threaded requests? Not that I'm looking for aneasy way out, but coming from the world of simply "using" a SQL-database and being presented with the needs of having toincorporate a lot of "logic" that the database server would'vehandled for you into your application seems a bit overwhelming atfirst. I don't know if it has anything to do with the wayAllegroCache is architected (e.g. things like isolation betweenconnections, etc) or other factors I'm not aware of. Again, I onlyread the tutorial in trying to learn more how to "apply theconcepts" of ODB so I could understand them better. Any commentswould be greatly appreciated. Thanks, Daniel_______________________________________________ elephant-develsite list elephant-devel@common-lisp.net http://common-lisp.net/mailman/listinfo/elephant-devel
_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel


_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel

Re: [elephant-devel] Understanding real-world use of Elephant

Reply via email to