About index and atomic operations

Emmanuel Lecharny Tue, 25 May 2010 04:41:41 -0700

Hi guys,

as I'm currently reviewing the JDBM code, I think we might get betterperformance if we use another solution than brutal synchronization (alsothe current implementation is not 100% threadsafe). Right now, theAbstractStore is the class responsible for accessing the entries, andupdate the DIT. As we must provide operation atomicity, we havesynchronized those methods :

- add(entry)
- delete(DN)
- modify()
- move()
- rename()

All the other methods aren't synchronized, so for instance callinggetUserIndex() does not guarantee that the returned index is protectedagainst concurrent modifications.

In fact, we must see an operation as atomic, which means no other threadcan access the modified index and data while the operation is notcompletely done.


JDBM does not offer such a level of protection.

In order to guarantee atomic operations, we should instead implement asystem based on MVCC (Multiple Version Concurrent Control), where allthe reads are done expecting that the data hasn't been modified sincethe operation started (avoiding costly locks to be used), and a twosteps modification :- first locally modify the data structure (nothing is locked until allthe modified elements are updated in memory). That means we read fromthe disk and store in memory the necessary elements for this operation.- then when done, acquire a global lock and update the data structurefor all the modified elements

Doing so will allow us to spare a lot of contention, as we only updatethe modified elements (ie, if we add an entry, the associated BTreesindex won't necessary be updated from root to leaves. Usually, only oneleaf is modified), restricting the time necessary to do the commit tothe minimum.

In any case, read are not synchronized, we just return what is presentin the backend using the latest version. Obviously, we can get an errorif for instance we are reading all the children of an entry, and if thisentry is deleted since the operation started, but that's not a problem:there is no guarantee in LDAP that a returned resut is stil present inthe DIT.

In order to do that, we will associate a version to each element westore, so we just have to compare the element version with the latestversion for this element.

Ok, this is a rough description of the whole mechanism, but that shouldwork well.


--
Regards,
Cordialement,
Emmanuel Lécharny
www.nextury.com

About index and atomic operations

Reply via email to