Re: Comparison Metrics for OpenJPA's ConcurrentHashMap

David Ezzio (asmtp) Wed, 30 May 2007 13:04:43 -0700

Hi Marc,

I did plug it in, but it failed straightaway on a security issue. Ishould probably read its documentation. :) I'll try it again along withthe backport lib done by Emory U.


David


Marc Prud'hommeaux wrote:

David-

That is very interesting.
Did you also take a look at the one athttp://sourceforge.net/projects/high-scale-lib ? They say itsperformance only shines for high thread/cpu counts, but it might beinteresting to see where its numbers lie in the range.
On May 29, 2007, at 11:01 AM, David Ezzio (asmtp) wrote:
Recently, I did some testing of Map implementations under concurrency.
My primary purpose was to verify the reliability of OpenJPA'sConcurrentHashMap implementation. As I got into it, I saw theopportunity to get some performance metrics out of the test.
The biggest part of my task was coming up with a reliable and usefultesting framework. I design it with the following two factors in mind:First, I wanted to test the edge conditions where an entry had justbeen added or removed or where a key's value had just been updated.The idea is that a number of threads add, remove, and update entries,while other threads check to see if these recent modifications arevisible (or in the case of removals, not visible). Second, I wantedthe testing framework itself to be free of synchronization. If thetesting framework used synchronization then it would tend to serializethe readers and writers and thereby mask concurrency issues in the mapimplementation under test.
The testing framework uses a non-synchronizing, non-blocking FIFOqueue as the mechanism for the writing threads to communicate theirrecent modifications to the reading threads.
To prevent writing threads from overwriting recent modificationsbefore they could be read and verified, the testing framework walksthe hash map keys in in a linear (or in the case of updates, circular)order. By using a hash map with a large enough capacity, readers havethe time to verify the recent modifications before the writer threadscome back to modify that part of the key space again.
Using an adapter for the map implementation, the testing frameworkstarts five writer threads and ten reader threads at the same time.These threads run wide open for 30 seconds, except that the readerswill give up their time slice if they find nothing on the queue. TheHashMaps were all sized for the needed capacity upon creation, so noresizing occurred during testing.
I got some interesting results.
Four implementations were tested, Java's unsynchronized HashMapimplementation, Java's synchronized HashMap implementation, Java'sConcurrentHashMap implementation, and OpenJPA's ConcurrentHashMapimplementation.
Only Java's unsynchronized HashMap failed, as expected, under test.Under test, this implementation demonstrates its inability to handleconcurrency. The other three implementations worked flawlessly undertest.
The java.util.concurrent.ConcurrentHashMap implementation (availablewith Java 5 and 6) was the fastest implementation tested.
Java's synchronized wrapper for the HashMap implementation is one totwo orders of magnitude slower than Java's ConcurrentHashMapimplementation.
OpenJPA's ConcurrentHashMap compares equally with Java'sConcurrentHashMap in find operations and is 2-4 times slower inmutating operations.
Implementation   Add   Remove   Update  Find-a  Find-r  Find-u
---------------+------+-------+--------+-------+-------+------
synchronized     103     35       50      40      37     54
concurrent      13.2    6.4      6.1     0.6     0.3    1.1
OpenJPA         29.8   26.6     27.9     0.6     0.6    0.6


Legend:

synchronized:
java.util.Collections.synchronizedMap(new java.util.HashMap())
concurrent: java.util.concurrent.ConcurrentHashMap
OpenJPA: org.apache.openjpa.lib.util.concurrent.ConcurrentHashMap

Add: time for average add operation
Remove: time for average remove operation
Update: time for average update of new value for existing key
Find-a: time to find a recent addition
Find-r: time to NOT find a recent removal
Find-u: time to find a recent update
These times (in microseconds) are representative, but are not theaverage of several runs. The tests were run on a Dell Dual Core laptopunder Windows. The performance meter was pegged during the tests.
David Ezzio

Re: Comparison Metrics for OpenJPA's ConcurrentHashMap

Reply via email to