Re: [DBCP] how does narrowing the lock help with deadlock in DBCP-270?

Phil Steitz Sat, 28 Jun 2008 00:15:03 -0700

sebb wrote:

On 27/06/2008, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:

Author: psteitz
 Date: Thu Jun 26 19:48:06 2008
 New Revision: 672097


 URL: http://svn.apache.org/viewvc?rev=672097&view=rev
 Log:
 Narrowed synchronization in AbandonedTrace to resolve an Evictor deadlock.
 JIRA: DBCP-270
 Reported and patched by Filip Hanik.


I've had a look at the patch and the JIRA, and I don't understand how
narrowing the lock works here.

Originally, the code locked on "this", whereas now the code uses "this.trace".

However, as far as I can see, no objects were taken out of the locked
blocks, and there is no other synchronization in the class. So what
else was locking on "this"?

So how does the change of lock object actually help here?

Or is it just a byproduct of the synchronisation that was added to
getTrace() as part of the patch?

The lock details provided in the JIRA don't show any locks on the
AbandonedTrace object - but this info could have been omitted.

I'm not saying that the patch is wrong, but I would like to understand
how it helps!

Good questions and thanks for reviewing. Even greater thanks forjumping in to DBCP and/or POOL :)


Now to the inscrutable world of DBCP...

PoolableConnection extends DelegatingConnection which extendsAbandonedTrace. The deadlock sequence in DBCP-270 is

1) DBCP client thread invokes close on PoolableConnection. This methodis synchronized, so locks the PC.2) Close for a PC means return to pool, so the associatedGenericObjectPool's returnObject is invoked. This callsGOP.addObjectToPool. Neither of these methods are synchronized, butaddObjectToPool includes two synchronized blocks, both of which lock thepool. The client thread makes it past the first block, which returnsthe connection to the pool, but before it can acquire the pool lock forthe second block (to decrement the number of active connections),3) The (dreaded) Evictor kicks off, locks the pool and attempts tovalidate the connection that was just returned. Validation involvesexecuting a query. Due to another painful-to-maintain feature of DBCP,when a DelegatingConnection creates a statement, it registers itself asthe source of the statement by passing a reference to itself to theDelegatingStatement that its createStatement creates. The way thisactually works is that the DelegatingStatement constructor ends upadding the DelegatingStatement to the trace property of theDelegatingConnection. This happens via the DelegatingConnection'saddTrace method (from AbandonedTrace). Before the patch, addTraceprotected the trace in a block synchronized on *this, locking theDelegatingConnection. This led to deadlock, since the client threadacquired that lock in 1). Reducing scope to the trace eliminates thiscontention. A workaround is to turn off testWhileIdle, or avoid usingthe Evictor altogether.

This is a good change for DBCP which makes progress toward addressingthe basic problem of client and evictor contention for locks. There areothers in DBCP-44. The deadlock reported in DBCP-270 is the same as thelast one mentioned in DBCP-44, which is new with POOL 1.4, where thesynchroniztion in addObjectToPool was broken into two blocks. Thatchange was made to a) keep factory methods out of pool-synchronizedscope (which was killing performance) and b) wait to decrementactiveCount until objects destroyed on return are destroyed. Thegeneral problem of preventing Evictor access to objects being borrowedor returned is documented in POOL-125. I have a patch for this that Iam testing.

Thanks in advance for any ideas, patches or RM-volunteering to help getDBCP 1.3 and/or POOL 1.5 out. We should cut a DBCP release as soon aswe can organize it to fix this and the several other issues that havebeen addressed in trunk.

Phil



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [DBCP] how does narrowing the lock help with deadlock in DBCP-270?

Reply via email to