Re: [jira] Created: (COCOON-1985) AbstractCachingProcessingPipeline locking with IncludeTransformer may hang pipeline

Ellis Pritchard Thu, 18 Jan 2007 09:26:12 -0800

Hi Ard,

I've not tried the double-aggregate thing (yet), but I've now attachedto the bug a very simple repeatable test that demonstrates the lock upas I've experienced it.


Have fun!

Ellis.


Ard Schrijvers wrote:

Hi,
The crux is that the sub-pipeline is called twice within thecontext ofthe master pipeline (once by the root pipeline, once by an include);thus the pipeline keys which are the same are those for thesub-pipeline, not the master pipeline.
My 'broken' pipeline is too complex to explain, but it's basicallysomething like:
If this results in a deadlock, then there is something basically wrong with this locking. Does the master pipeline lock its subpipeline untill it is finished itself? That wouldn't make sense.I mean, for the thing below, the master would have a lock related to the cachekey where the cachekey is something like:
PK_G-file-cocoon:/foo?pipelinehash=-3411761154931530775_T-xslt-/....page.xsl_S-xml-1

Then, as I would understand this key is locked. Now, the pipeline with 
pattern="foo" gets its own pipeline cachkey, which is also locked. But after 
this one is finished, your problem indicates that the lock of this sub pipeline is not 
cleared untill the master pipeline is finished? This doesn't make sense to me.

Furthermore, if this setup gives problems, then wouldn't

<map:aggregate>
        <map:part src="cocoon:/foo">
        <map:part src="cocoon:/foo">
</map:aggregate>

result in the same deadlock? I must be missing something trivial

Ard
<map:match pattern="master">
 <map:generate src="cocoon:/foo">
<map:transform src="page.xsl"/> <map:transform type="include"/> 
 <map:serialize/>
</map:match>

<map:match pattern="included">
 <map:generate src="cocoon:/foo">
 <map:transform src="included-page.xsl"/>
 <map:serialize/>
</map:match>

<map:match pattern="foo"> 
 <map:generate ... />
 <map:serialize/>
</map:match>

Ellis.


Ard Schrijvers wrote:
Hello,
Cocoon 2.1.9 introduced the concept of a lock inAbstractCachingProcessingPipeline, an optimization to preventtwo concurrent requests from generating the same cachedcontent. The first request adds the pipeline key to thetransient cache to 'lock' the cache entry for that pipeline,subsequent concurrent requests wait for the first request tocache the content (by Object.lock()ing the pipeline keyentry) before proceeding, and can then use the newly cached content.
However, this has introduced an incompatibility with theIncludeTransformer: if the inclusions access the sameyet-to-be-cached content as the root pipeline, the wholeassembly hangs, since a lock will be made on a lock alreadyheld by the same thread, and which cannot be satisfied.
e.g.
i) Root pipeline generates using sub-pipeline cocoon:/foo.xml
ii) the cocoon:/foo.xml sub-pipeline adds it's pipeline keyto the transient store as a lock.iii) subsequently in the root pipeline, the
IncludeTransformer is run.
iv) one of the inclusions also generates withcocoon:/foo.xml, this sub-pipeline locks inAbstractProcessingPipeline.waitForLock() because thesub-pipeline key is already present.
v) deadlock.
I do not understand one part of it. If a sub-pipeline is
called, cocoon:/foo.xml, there is lock generated for thissub-pipeline seperately, right? (if not, I do not understandwhy it is not like this. I suppose a lock is generated forthe root pipeline, but as well for every sub-pipelineindividually. I suppose though, because i did not actuallylook at the code).
Now, if the include transformer calls this same
sub-pipeline, which is having its own lock, I do not see whya deadlock can occur? The root-pipeline is locked, thesub-pipeline is locked as well. The include transformer wantsto include the same sub-pipeline, waits untill this one isfinished, then can includes it, right?
I most be missing something,
Regards Ard
I've found a (partial, see below) solution for this: insteadof a plain Object being added to the transient store as thelock object, the Thread.currentThread() is added; whenwaitForLock() is called, if the lock object exists, it checksthat it is not the same thread before attempting to lock it;if it is the same thread, then waitForLock() returns success,which allows generation to proceed. You loose the efficiencyof generating the cache only once in this case, but at leastit doesn't hang! With JDK1.5 this can be made neater by usingThread#holdsLock() instead of adding the thread object itselfto the transient store.
See patch file.
However, even with this fix, parallel includes (when enabled)may still hang, because they pass the not-the-same-threadtest, but fail because the root pipeline, which holds theinitial lock, cannot complete (and therefore statisfy thelock condition for the parallel threads), before the threadsthemselves have completed, which then results in a deadlock again.
The complete solution is probably to avoid locking if thelock is held by the same top-level Request, but that requiresmore knowledge of Cocoon's processing than I (currently) have!
IMHO unless a complete solution is found to this, then thisoptimization should be removed completely, or else madeoptional by configuration, since it renders theIncludeTransformer dangerous.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of theadministrators:
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira

Re: [jira] Created: (COCOON-1985) AbstractCachingProcessingPipeline locking with IncludeTransformer may hang pipeline

Reply via email to