Re: Concurrency issue with EHCacheTokenStore

Alessio Soldano Mon, 19 Sep 2016 01:19:35 -0700

ok, no failures during the weekend testsuite runs.

Ive created https://issues.apache.org/jira/browse/WSS-587 and here isthe patch I've triedhttps://github.com/asoldano/wss4j/commit/5a7897f7440940a11c0c853fbb8fb26c644fa898.diff.

Colm, if that's fine with you I can go ahead and commit and/or send a PR.


Cheers
Alessio

Il 16/09/2016 22:41, Alessio Soldano ha scritto:

OK, I have a patched wss4j 2.1.8-asoldano-SNAPSHOT on the snapshotrepository and I'm letting the CI server here run with it for fewdays. Let's see if the failures pop up or not...
Cheers
Alessio

Il 15/09/2016 11:20, Alessio Soldano ha scritto:
mmh... I need to build a patched wss4j snapshot and have it consumedby the remote machine that is reproducing the issue a bit morefrequently (locally it's very rare). Will let you know :-)
Il 15/09/2016 10:35, Colm O hEigeartaigh ha scritto:
Hi Alessio,

Yes, that makes sense to me. If you perform the fix locally, do the
intermittent failures go away?

Colm.

On Wed, Sep 14, 2016 at 9:55 PM, Alessio Soldano <[email protected]>
wrote:
Hi,
I'm currently seeing an intermittent issue in the JBossWS-CXFtestsuite
(stacktrace at https://paste.fedoraproject.org/428145/14738847/raw/ ),
with the EHCacheTokenStore creation failing due to the CacheManagerhavingbeen shutdown. The testsuite includes multiple tests, almost all ofthemcreate jaxws clients and in most of them the current thread bus isused(few of them do create a new bus, set it as default thread bus, runandeventually shutdown the bus). What I suspect is some kind ofconcurrency
issue in the CacheManager lifecycle management.

I've looked a bit at the code and noticed that there's basically a 1-1
relationship between Bus instances and CacheManager instances.Given I havesome tests that do not explicitly shutdown the bus (or the client)afterexecution, it can happen that a client is closed because the JDKeventuallyfinalize ClientProxy, which in the end causes theCacheCleanupListener toclose the token store and hence to release/shutdown the cachemanager (see
the invocation flow at https://paste.fedoraproject.or
g/428150/47388530/raw/ ). Unfortunately that exact cache manager could
possibly be in use to serve another client running in the same bus.AFAICS,
there's an attempt to avoid problems like this in WSS4J's
EHCacheManagerHolder (which deals with CXF requests ofcreating/releasing
cache managers), as it has a ConcurrentHashMap<String, AtomicInteger>
attribute to keep track of how many consumers of a given cachemanager arethere and avoid shutting down a manager if it's still in use.Looking atits getCacheManager and releaseCacheManager methods I can see apossible
concurrency flaw which could be the root of my failure. The
releaseCacheManager method could be called with cacheManager X asparameterwhile a different thread is running getCacheManager and is justbefore line106 (that is just before the AtomicInteger is got from the map)with localcacheManager variable already resolved to X. That should later dealto anattempt to use an already shutdown cache manager. I would betempted tosuggest making those two methods syncronized (the map could thenprobably
be a plain hash map).
WDYT? I might be missing something, so posting here before openingup a
jira. Any idea?

Cheers

Alessio


--
Alessio Soldano
Web Service Lead, JBoss



--
Alessio Soldano
Web Service Lead, JBoss

Re: Concurrency issue with EHCacheTokenStore

Reply via email to