Re: [elephant-devel] Understanding real-world use of Elephant

Ian Eslick Fri, 25 May 2007 14:33:40 -0700


On May 25, 2007, at 4:32 PM, [EMAIL PROTECTED] wrote:

Hello Ian, Robert, and Henrik
I'll try to comment based on the responses received from the threeof you in this single thread so as to minimize the posts. Beforeproceeding, let me just clarify that I am only interested in usingthe BDB backend.
I would have to disagree about the documentation for Elephant notbeing
abundant---Ian has written a 118 page manual.

Nonetheless, you are correct that the use of Elephant in a multi-
threaded webserver environment is not heavily documented.  Ian and I
have discussed the need for a killer "example app" and eagerly await
someone contributing one.
First of all, I want to apologize if my comment came across thewrong way. I know that Ian (and whoever else has been contributing)has done a superb job at enhancing Elephant's documentation. Itdefinitely has come a long way. I first had a bit of difficultyfinding the latest documentation since I couldn't find it online. Ithen learned that it came in the doc directory and that you had to"make" it. Anyway, it is great!

Documentation has always been available online so only developersupdating the web site or editing the documentation will need to'make' them. The new site has documentation at: http://common-lisp.net/project/elephant/documentation.html. This page can bereached by clicking the "documentation" link which you can find inthe leftmost column of the home page. You can jump directly to thelatest online texinfo style HTML by clicking 'Online Docs' in theupper right hand corner of the home page. Do you know what causedyou to miss these links to documentation? Was there anythingconfusing about our site that we could fix?

As far as multi-threaded webserver environment, I know there was asection about it in the doc (section 6.5) but, as you said, it'snot very elaborate.

Read 4.10, 4.11 and 4.13. 6.5 needs more work to server as a properexample. Section 6 mostly has placeholders at present. I'll seeabout expanding on 4.10 and 6.5 as time allows.

However, I don't have such a strong background on using ODBs andmainly come fromthe SQL world. So, just for curiousity's sake, I read thetutorial forAllegroCache which tries to show the "proper" way to useAllegroCache in
real-world systems
(http://www.franz.com/products/allegrocache/docs/acachetutorial.pdf).
I'd like to clarify my comment above. Because I read severalAllegroCache documents, I misreferenced the document I reallywanted to reference.
The document in question is title "AllegroCache with Web Servers"and can be found here: http://www.franz.com/products/allegrocache/docs/acweb.pdf

As you comment below, reading the acache document created a greatdeal of confusion! Please ignore it. While the object and classinterfaces are similar, the system implications and usage models canbe very different so as you comment, you are comparing apples andoranges.

In the first place, one has to go back to basics a little bit.Wheneveryou have concurrency, you have to have concurrency control.Personally,
I think to think of this at the object level, but I know it is now
common to think of it at the "database level".  You are generally
correct that if you are using SQL (or BDB, for that matter) as a
database and you keep ALL of your state in the database, then you can
generally rely on the concurrency control of those systems toserialize
everything and not allow two threads to interfere with each other.
However, almost ANY application will have to think aboutconcurrency; ifyour are SQL-orieneted, you will do this by defining"transactions". If
you define your transactions wrong, you will have concurrency errors,
even though the database serializes transactions perfectly.

For example, generally, since the Web forces us into "page based" or
"form based" interaction (which javascript, CSS and Ajax promptly
complicate), one can generally think of a web applications as "one-pageturn == one transaction". But even that might not be true---youcouldtake 10 page turns to submit an order, and the order must beatomic---that is, you either order the thing with 10 pages ofspecifications, or
you don't order it.  A half-order is a corrupt order.
I agree with you that, in general, when dealing with webapplications involving multiple clients and servers, you have tohave concurrency control. How much do you have to have in your ownapplication vs how much does the "database framework" offer is, inmy opinion, a good question.
Making reference to the Allegro document, it says "In AllegroCachea program opens a connection to a database and that connection canonly be used by one thread at a time." Then, as you read thedocument and focus on their client-server model, they presentsample code that uses "thread-safe" connection pools, with a macronamed with-client-connectionwith-client-connection. "This macroretrieves a connection from the pool. If no connection isavailable it will create a new connection but itwill create no more than *connection count* connections. If allconnections are created and a new connection is needed therequesting thread will be put to sleep until a connection isreturned to the pool."
The macro is not the problem, since I could "think of" this macroas something like Elephant's with-transaction. The problem, and theoverhead I was referring to in my original post is that, to performa basic operation such as to update a hash table value, they writethe function as this:
(defun set-password-for-pool (user new-password)
  (with-client-connection poolobj
    (with-transaction-restart nil
        (setf (map-value (or (poolobj-map poolobj)
                             (setf (poolobj-map poolobj)
                               (retrieve-from-index 'ac-map
                                                    'ac-map-name
                                                    "password")))
                         user)
          new-password)
        (commit))))
As you can see, there is some, possibly, unnecessary overhead inthe fact that you are getting a connection from the pool and thenobtaining a handle to the "password" hash table before anything canbe set. The reason, as I understand it, they do this is becausesince each connection handle works independently in each thread,each connection has to maintain a separate handle to eachpersistent object class and their solution involves storing in thepoolobj structure a handle to the connection and a handle to thehash table.
So, if this was a more complex application, involving n persistentclasses with m persistent attributes per class, the overhead ofwriting all this is significant. Assuming we follow the Elephantrecommendation in section 2.9.3 where actions should be reduced tominimal operations and nest them with with-transaction/ensure-transaction, I would have to write, potentially, 2*n*m defuns(getter/setter) for all the attributes with all the code to fetchand cache the handles to the connection and to the respective npersistent class.
Elephant has the "with-tranaction" macro. This is really the bestway
to use Elephant in most circumstances --- but even then, if you are
keeping ANYTHING in memory, whether to cache it for speed orbecause youdon't want to put it in the database (session-based informationwould be
a typical example), you may still have to "serialize" access to that
state. That unavoidable means that you have to understand somesort ofmutual exclusion lock; unfortunately, these are not 100% standardacross
different lisps.  However, most lisps do provide a basic "with-mutex"
facility.  I use this in the DCM code (in the contrib directory) to
serialize access to a "director", which is very similar to aversion ofIan's persistent classes, but based on a "keep it all in memoryand push
writes to the DB" model (that is, Prevalence.)
The idea I have is to rely on the persistent data instead of in-memory data. Once I get this going, I may decide to improveperformance with in-memory caches, or anything else. Just want toget the concept going in a stable and scaleable format.
If you will forgive me over-simplifying things, if:
1) You can legitimately think of every page turn as a transaction,and
2) You keep all of the meaningful state in the Elephant DB, and
3) You wrap your basic "process-a-page" function in "with-transaction",
then you won't have a concurrency control problem.

That is a completely appropriate style for certain relatively simple
web-applications; for other kinds of web-applications it would beveryconstraining and slow --- but one should never optimize for speedbefore
it is necessary.
I don't mind the over-simplification as long as I understand it :),and I do. However, thinking back to the AllegroCache document, fromwhat I understood, they basically take a handle to the connection,perform the operation, and then release the connection. If this wasa multi-page web operation, it seems that their recommendationwould not be most appropriate, IMHO, but then again, I don't know.

Connections and handles are completely different in elephant, acachedocs are not helpful.

In your recommendation, if I had a order entry system with multiplepages to be completed before committing the order, I couldunderstand wrapping the whole thing with with-transaction. However,wouldn't that present a possible problem locking resources andleaving it up to the human user to complete the process beforecommitting or rolling back the transaction?


There are lots of ways to think about this.

One is that you keep track of the ongoing session using in-memoryobjects unique to the session. When you need to manipulate adatabase (to submit an order, a blog entry, etc) the handler for the'submit' action uses with-transaction to take the data from the in-memory session object and commit it to the database (an entry in aper-user btree, adding a new instance to a class, etc).

If you need session history or want to maintain ongoing state, makethis session object a persistent object instead. Then each post orget action in the session is logged so you can recover if the usergoes away for awhile, or there is a server error. You willeventually fill up your disk with sessions (in the absence of GC) soyou need to either drop the session objects when you are done withthem or use a separate store for session objects and periodicallydelete and recreate it. We still need a clean model for online GC ofpersistent objects to avoid explicit reclamation.

As for contention, with-transaction will retry the transaction codeso if you have a POST handler you can do something like:


(defun handle-post-1
  (with-post-data
    (send-response-page
      (with-transaction ()
        <copy post data to persistent objects>
        <return response persistent object>))))

This way the update can robustly handle contention while the useronly sees the final page that results from the update to thepersistent object for that user/session. If there is a real problemand the process fails, you can wrap (send-response-page ...) with a'handler-case' form that sends a server error page with a link torestart the transaction (perhaps with the session object so the formentries are properly initialized on the retry) if the transactioncannot be committed.


Failing transactions signal 'transaction-retry-count-exceeded.

If you are using BDB make sure that db_deadlock is running. (Eitherwith the :deadlock-detect keyword option or by running an externalprocess (if using multiple lisp processes).

As a user of Elephant, you really shouldn't have to worry too much
about threading so long as you follow the simple rules laid out in
the manual under "multi-threading".  I think you are trying to
understand how we make this possible since it seems harder from your
read of the acache interface.
You may be right. However, thinking more about this whole thing andfrom my understanding of Elephant and what I understood ofAllegroCache, I may be trying to compare apples and oranges. Theymay be similar systems, but I don't know if it makes justicecomparing Elephant with AllegroCache client-server model. If Iunderstand it correctly (now), the current implementation ofElephant is more similar to AllegroCache stand alone (non-client-server) model. So, each web process that accesses Elephant can doso seamlessly with the standard *store-controller* (assuming asingle store controller) and not have to deal with having to manageconnection pools and all that. Keeping this in mind, I would alsoassume that in Elephant, I don't have to keep a handle to eachpersistent class for each connection. Maybe this is what confusedme and maybe I shouldn't be reading AllegroCache's documentation :)

Correct and correct. We implement the physical storage persistentclasses _very_ differently than acache and trying to compare thesystem implications of using each is likely to be more confusing thanhelpful. Don't think of them as the same kind of system, they aretwo different systems optimizing different aspects of the commonproblem of implementing persistent classes.

A simple conceptual model is that each thread has its own
transaction.  If these transactions are executing concurrently, the
BDB library or SQL server logs the side read dependencies and the
side effecting writes until a commit or abort.  On abort, you throw
away the log.  On a commit, one transaction at a time writes their
side effects and cancels any transaction that has read or written one
of the entries written by the committing transaction.

Thread isolation is guaranteed by a policy for using the BDB library
or SQL server.  Calls to these libraries are re-entrant.  The current
transaction ID (only used by one thread) determines where the reads
and writes are logged (this is a little more complex for SQL, but
handled by the data store and transparent to the user).
I guess this goes back to what I just commented on the fact thateach web thread/request will use the connection in place in theLisp VM and not have to deal with establishing a new connection (Icould have checks to make sure that if the store controller is notopened, I could open it, but once it's opened, I "shouldn't" haveto worry too much about it). Right?

Correct. Elephant maintains a connection to a BDB session whichmaintains an open file of the underlying logs and database files.This is shared among threads because BDB is re-entrant andtransaction ids are used to provide isolation in the presence ofconcurrency.

The only other thing we need to do is make sure that the parts of
elephant shared across threads are themselves protected by Lisp
locks.  Most of this is the store-controller and some data structures
used to initialize a new store controller.
As an end-user application developer, do I need to worry about thisor should I expect the Elephant framework to handle it?

Elephant handles it, sorry if I confused the issue but I thought youwere trying to understand how elephant implements thread safety.

If you stick to the user contract in the documentation, you shouldn't
have to worry further about interactions of multiple threads (other
than the usual considerations if you have shared lisp variables
instead of shared persistent objects).
I would assume you are referring to my own application sharedvariables and not Elephant-related variables, right?

Yes

I think that SQL databases are a safer bet than Berkeley DB
for having several processes on different machines talking to thesame
store, so I will have one instance of postgresql running on a server
with scsi raid 10 and lots of ram.
Henrik, would you mind elaborating more on this? Why would SQLdatabases be safer than the BDB stores? I know they are handled byseparate processes, potentially, on separate machines, so inessence, they are independent of your application. However, isn'tBDB designed just to tackle that using an application libraryinstead of a separate process?

The problem he is trying to solve is scaling computational power byusing multiple CPUs and multiple servers. This is doable withElephant so long as each independent lisp image is using the samedata store spec. However, if you have two machines Berkeley DB inits normal mode will not work correctly as it's locking facilitiesrequire shared memory between all processes sharing a given diskdatabase. So the multiple-CPU problem is solved by using N lispprocesses for N CPUs with shared memory. However the multiple serverproblem requires a common server that all web servers can talk to.This is easier to setup with SQL than to write your own server on topof BDB.

Overall, and being this my first experience with Lisp and ODBs, Ireally like Elephant. After reading some of AllegroCache'sdocumentation, I would still prefer using Elephant. Maybe I'mtrying to see deeper than I need to. Maybe I just need to see moresamples of real-world applications. I would love to contributesample applications to the project so as to make it clearer andeasier for others to learn, but I guess, I have to learn it myselffirst. Code-wise, I think I have grasped the whole thing. However,since I currently have no ability to test anything to a largerscale, I'm trying to understand what it would take for anapplication that uses Elephant to work in a large scale system(both hardware and software).

The biggest issue in scaling is when you think your application needsto be larger than a single server. Elephant is great for singleserver applications. When you scale to multiple servers it isbecause you are talking about high hundreds to thousands ofconcurrent sessions instead of dozens. That kind of traffic likelyrequires a highly reliable substrate and I'm not sure Elephant issufficiently hardened that I could recommend it for that kind ofuse. Unless, of course, you want to pave new ground with it in whichcase I think Elephant can get there.

Thanks again for everyone of your comments. They did in fact helpme and am sure you follow up comments will further help me evenmore. Now, while you guys digest this and reply to my post, I willgo back and read the updated Elephant manual :)
Thanks,
Daniel

Good luck, when you figure all this out a detailed summary of theprimary things that confused you would be helpful in improving thedocumentation.

_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel


_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel

Re: [elephant-devel] Understanding real-world use of Elephant

Reply via email to