Re: [webkit-dev] MessagePorts and garbage collection

Maciej Stachowiak Thu, 07 May 2009 11:25:17 -0700


On May 6, 2009, at 6:41 PM, Drew Wilson wrote:

Following up. I think I have my head around how Worker GC ishappening (I may start another thread about that, as it looks likethere's some cases where the thread won't be shut down, but thegeneral design is sound).
MessagePort GC is a little trickier, because we need to detect whenboth sides have no external references, based on this part of theHTML5 spec:[...] a message port can be received, given an event listener, andthen forgotten, and so long as that event listener could receive amessage, the channel will be maintained.
Of course, if this was to occur on both sides of the channel, thenboth ports would be garbage collected, since they would not bereachable from live code, despite having a strong reference to eachother.
From looking at the code in bindings/js, it looks like I've got twotools to manage object reachability:
1) I can tell when my object is reachable (during a GC) becausemark() will be invoked on it.2) I can force my object to stay active (as long as the owningcontext is active) by making it an ActiveDOMObject and returningtrue from hasPendingActivity() (which seems like it does nothing butinvoke mark() on the object).
So, #2 lets me keep an object alive, but to implement the spec, Ineed to be able to detect when my object has no more references,without actually having it get garbage collected. If I can do that,then I can build my own distributed state mechanism to allow me todetermine when it's safe to GC the objects.
I'm looking through the JSC::Collector code, and I didn't seeanything that did exactly what I want, but there are probably somethings that we could do with protect() to enable this. Has anyoneelse had to do anything like what I describe above? It's not exactlyeven a multi-thread issue, as it seems like this problem would occureven with just a single thread.

It is specifically a multi-thread issue, because with a single threadand single heap both MessagePorts could just mark() each other - ifthey have no other references, they will be collected anyway becauseGC will happily collect an unreferenced cycle.

It's only the separate per-thread heaps that make it challenging,since GC may occur at different times and on separate heaps, so thetwo MessagePorts have to protect each other in a persistent way untilboth become unreachable.

The best way I can think of to handle this is to have a special phaseafter normal marking where objects with an external/cross-threadreference get marked in a distinctive way. Then each MessagePort wouldknow if it was marked solely due to its opposite endpoint being live.I don't recall if there is a way for an unreachable MessagePort tobecome reachable - I think yes, because the message event listener canstuff the MessagePort in a global variable. But I think an unerachableport can only become reachable by receiving a message. Thus, you needa core data structure for the MessageChannel which detects the casethat there are no messages pending in either direct and both endpointsare alive only due to the other endpoint. Something like that. This isa very rough design sketch, Alexey can probably explain in more detailor I can study the code.

My impression is that Workers use a similar scheme with a specialadditional marking phase, or once did, but Alexey will recall betterthan I.


 - Maciej

-atw

2009/5/6 Drew Wilson <[email protected]>
Thanks, this puts me on the right track. I've had a bunch ofdiscussions with the Chrome folks on how we'd track MessagePortreachability in Chrome, but I'd hoped that the problem might besimpler in WebKit since we had direct access to the data structurescross-thread. The existence of separate GC heaps means it's notparticularly simpler after all.
-atw

2009/5/6 Maciej Stachowiak <[email protected]>


On May 6, 2009, at 1:53 PM, Drew Wilson wrote:
OK, that's good to know (it only supports document contexts) -clearly some work has been done to prepare for multi-thread usage(for example, the core data structure is a thread-safe MessageQueue).
I'm quite happy to drive this design (in fact, I'm in the middle ofthis now) but I would like to make sure I understand in generalwhat the correct approach is for managing GC-able objects that areaccessed cross-thread - I haven't been able to find anydocumentation (outside of the code itself).
Is the right approach to use JSLock when manipulating cross-threadlinkage? I'll write up a quick document to describe the approachI'm taking, but I'd like to understand your concerns aboutdeadlocks. So long as we have only a single shared per-channelmutex, and we never grab any other locks (like JSLock) aftergrabbing that mutex, we should be OK. Are there other locks thatmay be grabbed behind the scenes that I should be aware of?
JSLock is not the right approach. Workers have their own completelyseparate GC heap. JSLock only locks the current context group'sheap. It will not prevent collection in other heaps.
I don't know exactly what the right approach is. Ultimately it's adistributed GC problem, both for our split-heap multithreading andfor an approach that used processes for workers. And distributed GCis hard.
However, Worker itself has a similar issue, since it can be keptalive either from the inside or the outside reference. You couldlook at how that problem was solved.
 - Maciej
-atw

2009/5/6 Alexey Proskuryakov <[email protected]>

06.05.2009, в 21:38, Drew Wilson написал(а):
It looks like the JSC collection code relies on JSLock to lock theheap - I'm guessing that I'll need to explicitly grab the JSLockwhenever I'm manipulating the linkage between the two ports, isthat correct? Or is there a different/better way to handlesituations like this?
The JavaScriptCore implementation of MessagePorts only supportsdocument contexts (i.e., it only works on main thread).
As mentioned earlier, the first thing needed to implementMessagePorts in workers is a design of how they can be passedaround without breaking GC. It is likely that taking a lockwhenever atomicity is desired will cause deadlocks.
- WBR, Alexey Proskuryakov



_______________________________________________
webkit-dev mailing list
[email protected]
http://lists.webkit.org/mailman/listinfo.cgi/webkit-dev

_______________________________________________
webkit-dev mailing list
[email protected]
http://lists.webkit.org/mailman/listinfo.cgi/webkit-dev

Re: [webkit-dev] MessagePorts and garbage collection

Reply via email to