Re: RFC: 8229160: Reimplement JvmtiRawMonitor to use PlatformMonitor

David Holmes Tue, 10 Sep 2019 00:58:11 -0700

It turns out that polling for interrupts is actually very difficult todo correctly. There is an inherent race with timing out and beingsignalled that can result in lost signals. Trying to account for thatwithout introducing other problems would lead to a very complexsynchronization mechanism (which would need to track waiters,notifications and a "generation" count). The end result is that we'dbreak the tie to ObjectMonitor code, but would replace it with newcomplex untried code. :(


David


On 10/09/2019 8:49 am, David Holmes wrote:

Hi Serguei,

On 10/09/2019 4:26 am, [email protected] wrote:
Hi David,

On 9/8/19 19:15, David Holmes wrote:
Hi Dan,

On 7/09/2019 6:50 am, Daniel D. Daugherty wrote:
Hi David,

I've finally gotten back to this email thread...
Thanks.
FYI testing to date:
 - tiers 1 -3 all platforms
 - hotspot: serviceability/jvmti
                          /jdwp
            vmTestbase/nsk/jvmti
                          /jdwp
- JDK: com/sun/jdi
You should also add:

open/test/hotspot/jtreg/vmTestbase/nsk/jdb
open/test/hotspot/jtreg/vmTestbase/nsk/jdi
open/test/jdk/java/lang/instrument
Okay - in progress. Though I can't see any use of RawMonitors in anyof these tests.
I took a quick look through the preliminary webrev and I don't see
anything that worries me.
Thanks. I'll prepare a more polished webrev soon.
Re: Thread.interrupt() and raw_wait()

It would be good to see if that semantic is being tested via the
JCK test suite for JVM/TI.
It isn't. The only thing directly tested for RawMonitorWait is normalsuccessful operation and reporting "not owner" when not the owner. Nocheck for JVMTI_ERROR_INTERRUPT exists other than as input for theGetErrorName function.
This is most likely true.
My only concern is if RawMonitor's can be used in the JCK testlibraries (low probability).I've asked Leonid Kuskov (JCK) to double check this (added to themailing list).
There's only one test in the whole test base that checks for theinterrupt and that isvmTestbase/nsk/jvmti/RawMonitorWait/rawmnwait005/. In that test if weare not interrupted before the RawMonitorWait we will wait until thefull timeout elapses - which is 2 minutes by default - then returnand report the interrupt. Hence the test still passes. (If it was anuntimed wait that would be different of course).
I figured the same last Friday.
One more place to care about are NSK tests libraries that are locatedhere:
   test/hotspot/jtreg/vmTestbase/nsk/share

There are a couple of places where the RawMonitor is used:
jvmti/hotswap/HotSwap.cpp: if(!NSK_JVMTI_VERIFY(jvmti->RawMonitorWait(waitLock, millis))) jvmti/jvmti_tools.cpp: jvmtiError error =env->RawMonitorWait(monitor, millis);
The use in HotSwap.cpp is local.
The jvmti_tools.cpp defines this:
void rawMonitorWait(jvmtiEnv *env, jrawMonitorID monitor, jlongmillis) {
       jvmtiError error = env->RawMonitorWait(monitor, millis);
       exitOnError(error);
   }
which is used in the jvmti/agent_tools.cpp but does not depend oninterrupting of RawMonitor's (as I can see).
One more place to mention is:
   jvmti/DataDumpRequest/datadumpreq001/datadumpreq001.cpp

But I see no problems there as well.



The JDWP implementation is using RawMonitor's.
Please, see functions debugMonitorWait()/debugMonitorTimedWait() insrc/jdk.jdwp.agent/share/native/libjdwp/util.c.
It expects the JVMTI_ERROR_INTERRUPT but never makes a call to theJVMTI ThreadInterrupt().So, it looks like it does not depend on interrupting of RawMonitor'sin any way.
The more I try to convince people this change should be okay, themore uncomfortable I get with my own arguments. :) I think I'm goingto implement the polling approach for checking interrupts - say 500ms.
The JVMTI spec tells that the JVMTI_ERROR_INTERRUPT can be returnedfrom the RawMonitorWait:https://docs.oracle.com/en/java/javase/11/docs/specs/jvmti.html#RawMonitorWait
Yes it does and that is the only thing that implies a connection toThread.interrupt.
which means that RawMonitorWait can be interrupted with theThread.Interrupt()
or JVMTI InterruptThread():
https://docs.oracle.com/en/java/javase/11/docs/specs/jvmti.html#InterruptThread
That's one way to interpret the fact that RawMonitorWait can returnJVMTI_ERROR_INTERRUPT, but the actual interaction betweenThread.interrupt and RawMonitorWait is not explicitly stated. Arguablyyou can just check for interruption before and after the wait, to seewhether to return JVMTI_ERROR_INTERRUPT, without necessarily being ableto break out of the wait itself. That's been the whole premise of thischange proposal - that responsiveness to interrupts is more aquality-of-implementation issue.
But in any case I've decided to try the polling approach so that wewon't wait forever if interrupted but not notified.
Thanks,
David
-----
Thanks,
Serguei
I also very much like/appreciate the decoupling of JvmtiRawMonitors
from ObjectMonitors... Thanks for tackling this crazy task.
Thanks :)

David
Dan



On 8/15/19 2:22 AM, David Holmes wrote:
Bug: https://bugs.openjdk.java.net/browse/JDK-8229160
Preliminary webrev (still has rough edges):http://cr.openjdk.java.net/~dholmes/8229160/webrev.prelim/
Background:

We've had this comment for a long time:

 // The raw monitor subsystem is entirely distinct from normal
 // java-synchronization or jni-synchronization.  raw monitors are not
 // associated with objects.  They can be implemented in any manner
 // that makes sense.  The original implementors decided to piggy-back
// the raw-monitor implementation on the existing JavaobjectMonitor mechanism. // This flaw needs to fixed. We should reimplement raw monitorsas sui-generis. // Specifically, we should not implement raw monitors via javamonitors. // Time permitting, we should disentangle and deconvolve the twoimplementations // and move the resulting raw monitor implementation over to theJVMTI directories.
 // Ideally, the raw monitor implementation would be built on top of
 // park-unpark and nothing else.
This is an attempt to do that disentangling so that we can thenconsider changes to ObjectMonitor without having to worry aboutJvmtiRawMonitors. But rather than building on low-level park/unpark(which would require the same manual queue management and much ofthe same complex code as exists in ObjectMonitor) I decided to tryand do this on top of PlatformMonitor.
The reason this is just a RFC rather than RFR is that I overlookeda non-trivial aspect of JvmtiRawMonitors: like Java monitors (asimplemented by ObjectMonitor) they interact with theThread.interrupt mechanism. This is not clearly stated in the JVMTI specification [1] but only in passing by the possible errors forRawMonitorWait:
JVMTI_ERROR_INTERRUPT    Wait was interrupted, try again
As I explain in the bug report there is no way to build in properinterrupt support using PlatformMonitor as there is no way we can"interrupt" the low-level pthread_cond_wait. But we can approximateit. What I've done in this preliminary version is just checkinterrupt state before and after the actual "wait" but we won't getwoken by the interrupt once we have actually blocked. Alternativelywe could use a periodic polling approach and wakeup every Nms tocheck for interruption.
The only use of JvmtiRawMonitors in the JDK libraries (JDWP) is notaffected by this choice as that code ignores the interrupt untilthe real action it was waiting for has occurred. The interrupt isthen reposted later.
But more generally there could be users of JvmtiRawMonitors thatexpect/require that RawMonitorWait is responsive toThread.interrupt in a manner similar to Object.wait. And if any ofthem are reading this then I'd like to know - hence this RFC :)
FYI testing to date:
 - tiers 1 -3 all platforms
 - hotspot: serviceability/jvmti
                          /jdwp
            vmTestbase/nsk/jvmti
                          /jdwp
 - JDK: com/sun/jdi

Comments/opinions appreciated.

Thanks,
David
[1]https://docs.oracle.com/en/java/javase/11/docs/specs/jvmti.html#RawMonitorWait

Re: RFC: 8229160: Reimplement JvmtiRawMonitor to use PlatformMonitor

Reply via email to