Re: [elephant-devel] Cached objects

Ian Eslick Sun, 18 May 2008 06:34:53 -0700


On May 18, 2008, at 6:05 AM, Alex Mizrahi wrote:

IE> If you use the discipline of using with-caching before the firstreadIE> of a cached instance's slot inside with-transaction, then youget the
IE> following cool behavior in a multi-threaded scenario:
IE> - txn one reads the cached values (setting read locks)
IE> - txn one does various ops in memory
IE> - txn one then commits the instance values (grabbing write locks)
IE> - txn two in running in parallel and does refresh, grabbin readlocks
IE> - txn two commits after txn one and is aborted and restarted.
IE> In this mode of operation, I believe that we can guarantee thatthe
IE> ACID properties are maintained at transaction boundaries.
how is that? both txn one and two grab read locks, so they don'tblock each other, right?then txn one modifies slots, and txn two immidiately sees changes,that breaks isolation, doesn't this?only way to avoid collisions is to make even read locks exclusive,but that will totally kill paralellism -- even read-onlytransactions working with same objects will not be runningconcurrently.

You are quite right! It's been a long week, so please bear with me onthis. :) I was thinking about my original write-through mode, whichwould catch this cases as the writes will block the writer (becauseall read locks were grabbed at the beginning of the session). Youwouldn't want to use caching mode in the presence of high degrees ofsharing anyway - so if we grabbing write locks (i.e. exclusive readlocks) just for the cached objects the cost wouldn't be too high inconcurrency but we'd make sure these semi-independent operations wereisolated. Any shared objects, especially those rarely written, shouldbe handled in a write-through mode.

I'm sure there are still holes. Would you like to help think througha policy that we could throw into a macro wrapper to make thiseasier. I'd like to have one or two out-of-the-box use cases for thisand leave the rest to the advanced user.

Hmmm...you could also engineer around some of the above tradeoffs byimplementing a policy where a thread or slot is used to ensure thatany thread that will side effect a set of fully cached objects firsthas to grab an exclusive lock on a designated slot (essentially usingthe DB to grab a semaphore, but ostensibly at a lower cost in time andcomplexity) prior to the side effecting operations in memory.

i believe that "the right way" to do such stuff is thread/transaction local cache -- i.e. each transactions bounds *slot-cache* (e.g. hash-table), and all slot reads/writes work with it.slot reads/modifications performance will be somewhat inferior, butit won't break concurrency.

I agree with you here. However, while they are constant time, mosthash ops are rather expensive (it was the biggest bottleneck in theserializer) and clearing or allocating hashes on each transaction isreally expensive in my experience. If you only had a dozen or soslots, even an alist would be faster... I think the usual tradeoff isa 20-deep alist search is the same as a single hash fetch.

IE> - Slot indexing is a problem. Completely cached slots will beout ofIE> sync with their index because you can't keep the index in-syncwith
IE> the cached slot, so the API is broken (add instance, do query, new
IE> instance not in query result!).  This can be fixed with a write-
IE> through mode on the indexed slots.
partially this can be fixed by doing caching on btree level ratherthan on slot level.


Please explain further, I'm not sure I follow you here.

Thank you for the input!
Ian

_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel


_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel

Re: [elephant-devel] Cached objects

Reply via email to