OOMEInReferenceHandler.java fails intermittently

Mandy Chung Mon, 23 Dec 2013 15:10:08 -0800

On 12/23/2013 2:02 PM, srikalyan chandrashekar wrote:

Hi Mandy, after some trials i could simulate the failure again (nowwith UEH in place), however the UEH now cannot print enough details asit also tries to allocate memory, when it does Thread.getName()(itinternally creates a String object), printStackTrace() also createsnew WrappedPrintStream object. See the following trace

That's what I later also thought that may run into after suggesting UEHand no object can be allocated at this point.

It worths trying Peter's suggestion to override the modified version ofReference class with instrumentation and see what you will get.


Mandy

Exception: java.lang.OutOfMemoryError thrown from theUncaughtExceptionHandler in thread "Reference Handler"
ERROR: java.lang.Exception: Reference Handler thread died.
        at OOMEInReferenceHandler.main(OOMEInReferenceHandler.java:105)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
atsun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)atsun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:483)
atcom.sun.javatest.regtest.MainWrapper$MainThread.run(MainWrapper.java:94)
        at java.lang.Thread.run(Thread.java:744)
Meanwhile i am trying looking around to actually print somethinguseful without allocating any new memory.
---
Thanks
kalyan

On 12/20/2013 01:00 PM, srikalyan wrote:
Hi Mandy, yes I ran with JTreg to simulate the failure, i will trythe UEH patch to see if it sheds some light and get back to you.Thanks for the direction :)
--
Thanks
kalyan
Ph: (408)-585-8040


On 12/19/13, 8:33 PM, Mandy Chung wrote:
Hi Srikalyan,

Maybe you can get add an uncaught handler to see if you can get
any information.  I ran it for 1000 times but not able to duplicate
the failure.  Did you run it with jtreg (I didn't)?

Below is the patch to install a thread's uncaught handler that
you can take and try.
diff --git a/test/java/lang/ref/OOMEInReferenceHandler.javab/test/java/lang/ref/OOMEInReferenceHand
ler.java
--- a/test/java/lang/ref/OOMEInReferenceHandler.java
+++ b/test/java/lang/ref/OOMEInReferenceHandler.java
@@ -51,6 +51,14 @@
          return first;
      }

+     static class UEH implements Thread.UncaughtExceptionHandler {
+         public void uncaughtException(Thread t, Throwable e) {
+ System.err.println("ERROR: " + t.getName() + "exception " +
+                 e.getMessage());
+             e.printStackTrace();
+         }
+     }
+
      public static void main(String[] args) throws Exception {
// preinitialize the InterruptedException class so thatthe reference handler// does not die due to OOME when loading the class if itis the first use
@@ -77,6 +85,8 @@
throw new IllegalStateException("Couldn't findReference Handler thread.");
          }

+ referenceHandlerThread.setUncaughtExceptionHandler(new UEH());
+
          ReferenceQueue<Object> refQueue = new ReferenceQueue<>();
          Object referent = new Object();
WeakReference<Object> weakRef = newWeakReference<>(referent, refQueue);
On 12/19/2013 6:57 PM, srikalyan chandrashekar wrote:
Hi David Thanks for your comments, the unguarded part(clean andenqueue) in the Reference Handler thread does not seem to createany new objects, so it is the application(the test in this case)which is adding objects to heap and causing the Reference Handlerto die with OOME. I am still unsure about the side effects of thecode change and agree with your thoughts(on memory exhaustiontest's reliability).
PS: hotspot dev alias removed from CC.

--
Thanks
kalyan

On 12/19/13 5:08 PM, David Holmes wrote:
Hi Kalyan,
This is not a hotspot issue so I'm moving this to core-libs,please drop hotspot from any replies.
On 20/12/2013 6:26 AM, srikalyan wrote:
Hi all,  I have been working on the bug JDK-8022321
<https://bugs.openjdk.java.net/browse/JDK-8022321> , this is asporadic
failure and the webrev is available here
http://cr.openjdk.java.net/~srikchan/Regression/JDK-8022321_OOMEInReferenceHandler-webrev/
I'm really not sure what to make of this. We have a test thattriggers an out-of-memory condition but the OOME can actually turnup in the ReferenceHandler thread causing it to terminate and thetest to fail. We previously accounted for the non-obviousoccurrences of OOME due to the Object.wait and the possible needto load the InterruptedException class - but still the OOME canappear where we don't want it. So finally you have just placed thewhole for(;;) loop in a try/catch(OOME) that ignores the OOME. I'mcertain that makes the test happy, but I'm not sure it is reallywhat we want for the ReferenceHandler thread. If the OOME occurswhile cleaning, or enqueuing then we will fail to clean and/orenqueue but there would be no indication that has occurred and Ithink that is a bigger problem than this test failing.
There may be no way to make this test 100% reliable. In fact I'dsuggest that no memory exhaustion test can be 100% reliable.
David
*
**"Root Cause:Still not known"*
2 places where there is a possibility for OOME
1) Cleaner.clean()
2) ReferenceQueue.enqueue()
1) The cleanup code in turn has 2 places where there ispotential for
throwing OOME,
a) thunk Thread which is run from clean() method. ThisRunnable is
passed to Cleaner and appears in the following classes
         java/nio/DirectByteBuffer.java
         sun/misc/Perf.java
         sun/nio/fs/NativeBuffer.java
         sun/nio/ch/IOVecWrapper.java
         sun/misc/Cleaner/ExitOnThrow.java
However none of the above overridden implementations ever create an
object in the clean() code.
     b) new PrivilegedAction created in try catch Exception block of
clean() method but for this object to be created and to be held
responsible for OOME an Exception(other than OOME) has to be thrown.
2) No new heap objects are created in the enqueue method noranywhere in
the deep call stack (VM.addFinalRefCount() etc) so this cannot be a
potential cause.

*Experimental change to java.lang.Reference.java* :
- Put one more guard (try catch with OOME block) in the Reference
Handler Thread which may give the Reference Handler a chance tocleanup.
This is fixing the test failure (several 1000 runs with 0 failures)
- Without the above change the test fails atleast 3-5 times forevery
1000 run.
*PS*: The code change is to a very critical part of JDK and i amfullynot aware of the consequences of the change, hence seeking experthelp
here. Appreciate your time and inputs towards this.

Re: Analysis on JDK-8022321 java/lang/ref/OOMEInReferenceHandler.java fails intermittently

Reply via email to