Dénes Bodó created OPENJPA-2919: ----------------------------------- Summary: Connection pool can be exhausted when connections are killed on the DB side Key: OPENJPA-2919 URL: https://issues.apache.org/jira/browse/OPENJPA-2919 Project: OpenJPA Issue Type: Bug Components: jdbc Affects Versions: 3.2.2 Reporter: Dénes Bodó
Apache Oozie 5.2.1 uses OpenJPA 2.4.2 and commons-dbcp 1.4 and commons-pool 1.5.4. These are ancient versions, I know. h1. Description The issue is that when due to some network issues or "maintenance work" on the DB side (especially PostgreSQL) which causes the DB connection to be closed, it results exhausted Pool on the client side. Many threads are waiting at this point: {noformat} "pool-2-thread-4" #20 prio=5 os_prio=31 tid=0x00007faf7903b800 nid=0x8603 waiting on condition [0x000000030f3e7000] java.lang.Thread.State: WAITING (parking) at sun.misc.Unsafe.park(Native Method) - parking to wait for <0x000000066aca8e70> (a java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject) at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175) at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2039) at org.apache.commons.pool2.impl.LinkedBlockingDeque.takeFirst(LinkedBlockingDeque.java:1324) {noformat} According to my observation this is because the JDBC driver does not get closed on the client side, nor the abstract DBCP connection _org.apache.commons.dbcp2.PoolableConnection_ . h1. Repro (Un)Fortunately I can reproduce the issue using the latest and greatest commons-dbcp 2.11.0 and commons-pool 2.12.0 along with OpenJPA 3.2.2. I've just created a Java application to reproduce the issue: [https://github.com/dionusos/pool_exhausted_repro] . See README.md for detailed repro steps. h1. What we tried so far I got in touch with DBCP team who confirmed that in case of an error in the connection the client (in this case OpenJPA is the client of DBCP) should handle the exception like closing the connection: DBCP-595. I agree with them as based on the investigation I did I can also confirm that DBCP is really robust when the client releases the broken connection object after catching SQLException. Please check the 4 comments on DBCP-595 for extra details. h1. Ask OpenJPA team! * Could you please confirm that my findings are valid? * Did I do anything wrong in my repro program? * Oozie has retry logic implemented: [https://github.com/apache/oozie/blob/318fac5/core/src/main/java/org/apache/oozie/service/JPAService.java#L397L427] but this cannot avoid the reported dead lock. * Do you have any questions I can answer to help in the investigation? -- This message was sent by Atlassian Jira (v8.20.10#820010)