On 05/04/2013 01:42 AM, Vitaly Davidovich wrote:

Personally, I think I'd exit the VM in this case. The odds of hitting OOM while allocating TIE and having it be just a very unfortunate transient condition are quite low; most likely, the VM is going to have lots of trouble elsewhere anyway.


I thought the purpose of fixing this bug was exactly to support un-terminated reference processing in situations where OOME *is* a transient condition, and not to take every opportunity to exit the VM when OOME is encountered.

Also, by swallowing the OOM there and continuing makes an assumption that the lock is still in valid/working state; that may be the case today, but I don't know if that's a safe assumption generally.


I think It would be a JVM bug if it wasn't so. The construction and propagation of InterruptedException should be and is attempted after the ownership of the object monitor is re-obtained.

Besides, no OOME will be thrown if the ReferenceHandler thread isn't interrupted while there is heap memory shortage, but VM is equally "in trouble elsewhere" nevertheless. So I don't think it's ReferenceHandler's call to decide when to terminate the VM. It can continue with it's purpose unaffected...

Another thing to be considered is what happens when the ReferenceHandler thread is stop()-ed while it is executing code inside wait(). In this case a ThreadDeath is thrown which should not be caught and ignored. But as it appears, the ThreadDeath error object is constructed by the thread executing the Thread.stop() method so the OOME can only be thrown in that thread and not in the thread being stopped...

To back this claims, I have written the following test:

public class Test extends Thread {

    final Object lock = new Object();

    public Test() {
        super("test");
    }

    static void log(String msg) {
        System.err.println(
            Thread.currentThread().getName() +
" @ " + new SimpleDateFormat("hh:mm:ss.SSS").format(new Date()) +
            ": " + msg
        );
    }

    @Override
    public void run() {
        while (true) {
            synchronized (lock) {
                try {
                    log("waiting");
                    lock.wait();
                }
                catch (InterruptedException ie) {
                    log("interrupted");
                    ie.printStackTrace();
                }
                catch (ThreadDeath td) {
                    log("stopped");
                    td.printStackTrace();
                    throw td;
                }
            }
        }
    }

    public static void main(String[] args) throws Exception {
        Test test = new Test();
        test.start();

        Thread.sleep(1000L);
        synchronized (test.lock) {
            log("interrupting");
            test.interrupt();
            Thread.sleep(1000L);
        }
        log("exited synchronized block 1");

        Thread.sleep(1000L);
        synchronized (test.lock) {
            log("stopping");
            test.stop();
            Thread.sleep(1000L);
        }
        log("exited synchronized block 2");

        test.join();
    }
}


Which produces the following output:

test @ 09:57:53.998: waiting
main @ 09:57:54.974: interrupting
main @ 09:57:55.975: exited synchronized block 1
test @ 09:57:55.975: interrupted
java.lang.InterruptedException: Constructed by test @ 09:57:55.975
    at java.lang.Object.wait(Native Method)
    at java.lang.Object.wait(Object.java:502)
    at test.Test.run(Test.java:36)
test @ 09:57:55.977: waiting
main @ 09:57:56.975: stopping
main @ 09:57:57.976: exited synchronized block 2
test @ 09:57:57.976: stopped
java.lang.ThreadDeath: Constructed by main @ 09:57:56.976
    at java.lang.Thread.stop(Thread.java:815)
    at test.Test.main(Test.java:66)


I also modified the InterruptedException and ThreadDeath no-arg constructors to record the thread name and time of construction to get this output:

    public InterruptedException() {
        super("Constructed by " + Thread.currentThread().getName() +
" @ " + new SimpleDateFormat("hh:mm:ss.SSS").format(new Date()));
    }


Regards, Peter

Sent from my phone

On May 3, 2013 3:26 PM, "Peter Levart" <peter.lev...@gmail.com <mailto:peter.lev...@gmail.com>> wrote:


    On 05/03/2013 07:47 PM, Thomas Schatzl wrote:

        Hi,

            Hi Tomas,

            I don't know if this is the case here, but what if the
            ReferenceHandler thread is interrupted while wait()-ing
            and the
            construction of InterruptedException triggers OOME?

        I am sure this is the case - previously I thought
        InterruptedException
        is a preallocated exception like others.
        ObjectMonitor::wait() may throw it, by creating new
        InterruptedException
        instances.

        Thanks!

        Now that we've found the very likely cause, what to do about it?


    Maybe just ignore it since if it happens during wait(), the cause
    is supposed to be interrupted thread and the InterruptedException
    that was to be thrown would be ignored too:

                            try {
                                lock.wait();
                            } catch (InterruptedException |
    OutOfMemoryError x) { }

    Regards, Peter

        The current state of silently crashing the reference handler
        thread is
        unsatisfying imo as it leads to very hard to find problems.

        The options I see all involve catching this (or any other OOME
        caused by
        other means like the test program) and either recovering as
        much as
        possible or exiting the VM (like in the sun.misc.Cleaner
        handling).

        Any other suggestions?

        Thanks,
        Thomas



Reply via email to