Re: [Zope] minimizing conflict errors

Chris McDonough Sun, 20 Nov 2005 11:46:57 -0800


On Nov 20, 2005, at 12:16 PM, Dennis Allison wrote:

The structure of the naviagation method is simple enough.Everything is

wrapped in a <dtml-let> which sets a number of parameters mostly by
reading them from the SESSION (with an interface function) or plucking
them from the relational database with a query.

In the scope of the let is dtml code which, when rendered, providesthe

various navigation links.  In various sections there are additional
<dtml-let> blocks and additional queries to the relational database
and several <dtml-in> loops.

Looking at the code, I don't understand why I am seeing conflicts.
As I understand things, neither variables in the <dtml-let> space nor
the REQUEST/RESPONSE space are stored in the ZODB so modifications to
them don't look like writes to the conflict mechanism.  Am I incorrect
in my understanding?


Yes, but that's understandable.  It's not exactly obvious.

The sessioning machinery is one of the few places in Zope where it'snecessary for the code to do what's known as a "write on read" in theZODB database.

Even if you're just "reading" from a session, looking up a session,or doing anything otherwise related to sessioning, it's possible foryour code to generate a ZODB write.This is why you get conflicts even if you're "just reading"; wheneveryou access the sessioning machinery, you are potentially (but notalways) causing a ZODB write. All writes can potentially cause aconflict error.

While this might sound fantastic, it's pretty much impossible toavoid when using ZODB as a sessioning backend. The sessioningmachinery has been tuned to generate as few conflicts as possible,and you can help it by doing your own timeout, resolution, andhousekeeping tuning as has been suggested. MVCC gets rid of readconflicts. But it's not possible to completely avoid write conflictsunder the current design.

Here's why. The sessioning machinery is composed of three major datastructures:

- an index of "timeslice" to "bucket". A timeslice is an integerrepresenting

  some range of time (the range of time is variable, depending on the

"resolution", but out of the box, it represents 20 seconds).This mapping

  is an IOBTree.

- A "bucket" is a mapping from a browser id to "session dataobject" (aka

  transient object).  This mapping is an OOBTree.

- three "increasers" which mark the "last" timeslice in whichsomething was done

  (called the garbage collector, called the finalizer, etc).

The point of sessioning is to provide a writable namespace assignedto a single user that expires after some period of inactivity by thatuser. To this end, we need to keep track of when the last time theuser "accessed" the session was. This is the point of the index.

When a user accesses his session, we may need to move his sessiondata object (identified by his browser id) from one bucket(representing an older timeslice) to another (representing a newertimeslice). This needs to happen *even if your code doesn't writeanything to his session*, because it represents a session access, andthe session is defined by total inactivity (not just writeinactivity). Likewise, when a user runs code that requires access toa session, but that user does not yet have a session data object, awrite may need to occur. So seemingly innocuous accesses to sessiondata can cause a write. Consider, in a Python script:


req = context.REQUEST
REQUEST.SESSION

Looks pretty harmless and unlikely to cause a write. However, that'snot true. If the "bucket" in which the user's session data object isfound is not associated with the "current" timeslice, we need to movehis data object to the bucket that *is* associated with the currenttimeslice, which is a write operation in order to make note of thefact that his session is now "current".


Likewise with:

req = context.REQUEST
a = REQUEST.SESSION.get('foo')

Even though this appears to be "only a read", the sessioningmachinery itself may need to perform a write operation to move theuser's data object to the current bucket.

Jacking up the resolution time increases the period of timerepresented by a single timeslice, so fewer total writes need to beperformed to keep a session "current". Turning on "externalhousekeeping" doesn't prevent this normal movement of data objectsbetween buckets, it just causes another process that cleans up"stale" data from happening during normal sessioning operations.

The sessioning machinery attempts to minimize conflicts. The 2.8version of the temporarystorage does MVCC, which essentiallyeliminates read conflict errors. The transience machinery includessignificantly complicated logic to attempt to prevent conflict errorsfrom occurring including code that attempts to prevent two threadsfrom doing housekeeping at once as well as application level conflictresolution for simultaneous writes to the same session data object.However, the machinery uses BTrees to hold indexes. BTrees also havea limited number of conflict avoidance strategies, but under certaincircumstances (a "bucket split" is the canonical case) it cannot beavoided so not all write conflicts can be prevented without using adifferent kind of data structure to hold sessioning data.

A more detailed description of how "transience" works is availablewithin the file named "HowTransienceWorks.txt" in the Products/Transience package within Zope in case you're interested.

I hope this explains why you see conflict errors even if your code"doesn't do any writes", because actually it probably does by virtueof accessing a session. Tuning the knobs that come with themachinery helps. Causing transactions to be as short as possiblealso helps (by not using ZEO to back the sessioning database or bymaking your code just generally faster) because then there is less ofa chance of a conflicting change.


- C

_______________________________________________
Zope maillist  -  [email protected]
http://mail.zope.org/mailman/listinfo/zope
**   No cross posts or HTML encoding!  **

(Related lists -http://mail.zope.org/mailman/listinfo/zope-announce

http://mail.zope.org/mailman/listinfo/zope-dev )

Re: [Zope] minimizing conflict errors

Reply via email to