Re: RFR (M) 8212160: JVMTI agent crashes with "assert(_value != 0LL) failed: resolving NULL _value"

coleen . phillimore Tue, 03 Dec 2019 05:08:38 -0800



On 12/2/19 11:52 PM, David Holmes wrote:

Hi Coleen,

On 3/12/2019 12:43 am, [email protected] wrote:
On 11/26/19 7:03 PM, David Holmes wrote:
(adding runtime as well)

Hi Coleen,

On 27/11/2019 12:22 am, [email protected] wrote:
Summary: Add local deferred event list to thread to post eventsoutside CodeCache_lock.
This patch builds on the patch for JDK-8173361. With this patch, Imade the JvmtiDeferredEventQueue an instance class (not AllStatic)and have one per thread. The CodeBlob event that used to drop theCodeCache_lock and raced with the sweeper thread, adds the eventsit wants to post to its thread local list, and processes it outsidethe lock. The list is walked in GC and by the sweeper to keep thenmethods from being unloaded and zombied, respectively.
Sorry I don't understand why we would want/need a deferred eventqueue for every JavaThread? Isn't this only relevant fornon-JavaThreads that need to have the ServiceThread process thedeferred event?
I thought I'd written this in the bug but I had only discussed thiswith Erik. I've added a comment to the bug to explain why I addedthe per-JavaThread queue. In order to process these events after theCodeCache_lock is dropped, I have to queue them somewhere safe. TheServiceThread queue is safe, *but* the ServiceThread can't keep upwith the events, especially from this test case. So the test casegets a native OOM.
So I've added the safe queue as a field to each JavaThread becausemultiple JavaThreads could be posting these events at the same time,and there didn't seem to be a better safe place to cache them,without adding another layer of queuing code.
I think I'm getting the picture now. At the time the events aregenerated we can't post them directly because the current thread isinside compiler code. Hence the events must be deferred. Using theServiceThread to handle the deferred events is one way to deal withthis - but it can't keep up in this scenario. So instead we store theevents in the current thread and when the current thread returns tocode where it is safe to post the events, it does so itself. Is thatgenerally correct?


Yes.

I admit I'm not keen on adding this additional field per-thread justfor a temporary usage. Some kind of stack allocated helper would bepreferable, but would need to be passed through the call chain so thatthe events could be added to it.

Right, and the GC and nmethods_do has to find it somehow. It wasn't myfirst choice of where to put it also because there is too many things inJavaThread. Might be time for a future cleanup of Thread.

Also I'm not clear why we aggressively delete the _jvmti_event_queueafter posting the events. I'd be worried about the overhead we areintroducing for creating and deleting this queue. When theJvmtiDeferredEventQueue data structure was intended only for use bythe ServiceThread its dynamic node allocation may have made moresense. But now that seems like a liability to me - ifJvmtiDeferredEvents could be linked directly we wouldn't need dynamicnodes, nor dynamic per-thread queues (just a per-thread pointer).

I'm not following. The queue is for multiple events that might beposted while in the CodeCache_lock, so they need to be in order andlinked together. While we post them and take them off, if the callbacksafepoints (maybe calls back into the JVM), we don't want to have GC ornmethods_do walk the one that's been posted already. So a queue seems tomake sense.

One thing that I experimented with was to have the ServiceThread takeownership of the queue in it's local thread queue and post them all,which could be a future enhancement. It didn't help my OOM situation.

Deleting the queue after all the events are posted allowsJavaThread::oops_do and nmethods_do only a null check to deal with thisjvmti wart.


Thanks,
Coleen

Just some thoughts.

Thanks,
David
I did write comments to this effect here:
http://cr.openjdk.java.net/~coleenp/2019/8212160.01/webrev/src/hotspot/share/prims/jvmtiCodeBlobEvents.cpp.udiff.html
Thanks,
Coleen
David
Also, the jmethod_id field in nmethod was only used as a boolean sodon't create a jmethod_id until needed forpost_compiled_method_unload.
Ran hs tier1-8 on linux-x64-debug and the stress test that crashedin the original bug report.
open webrev athttp://cr.openjdk.java.net/~coleenp/2019/8212160.01/webrev
bug link https://bugs.openjdk.java.net/browse/JDK-8212160

Thanks,
Coleen

Re: RFR (M) 8212160: JVMTI agent crashes with "assert(_value != 0LL) failed: resolving NULL _value"

Reply via email to