Re: RFR(L): 8218628: Add detailed message to NullPointerException describing what is null.

Mandy Chung Tue, 26 Mar 2019 14:47:59 -0700



On 3/26/19 4:57 AM, Lindenmaier, Goetz wrote:

  and also the cases when it requires to look at its caller frame.

Never.  Only the method that was executed when the exception is
thrown is needed. I only mention the backtrace because it happens to
store the top method and the corresponding bytecode position in the
top method.


That explains why I don't see an example of:
     getNull().m();

     public void m() {};

     static NPE getNull() {
         return null;
     }

For a compiler expert or one who study the data flow analysis paper, isthis example very clear to them that it won't be supported?

This is related to my comment w.r.t. the data flow analysis comes from. I was looking for some level of information be spelled out in the JEPthat can sufficiently give a non-compiler expert an idea but of courseno point to replicate the paper here.

It's good to know that this proposal needs the top frame only. Pleasemake it clear inthe JEP. The JEP can take out `StackWalker` but instead talks about`StackFrame`since doing it in Java means that it only needs to get back a`StackFrame` instance

representing the top frame stored in `Throwable::backtrace`

This will avoid the confusion on calling StackWalker in the NPEconstructor and

constructing many StackFrame and many ResolvedMethodName.  That was part
of the code review discussion.

"Computing either part of these messages can fail due to insufficient
information."
What are the cases that it fails to obtain the information? It'd be useful
to list some examples if not all.

(a ? b : c).d
You can not know whether the ?-expression yielded b or c, thus
you don't know why the access .d failed.  There is a corresponding example
in [1].

Please describe in the JEP the known cases that this proposal doesn'tcover. Conditional expression is one and null receiver if it's thereturn value from a caller frame (I think this would help the readers).

The "Different bytecode variants of a method" section

This section raises the case when a method representing an execution stack
frame throwing NPE becomes obsolete, it does not have the information
to compute the message lazily.  So this falls into the "insufficient
information" category and no improved NPE message can be generated.

Yes, note this also only addresses the method in which the exception was raised.

This should also go to the list of known cases that this proposal doesnot cover.

The "computation overhead" section

I agree with the requirement to have minimal performance overhead to
the NPE construction cost.

"NullPointerExceptions are thrown frequently."
"Fortunately, most Exceptions are discarded without looking at the message."

This is interesting.  Do you have any statistic on the number of NPE thrown
and swallowed on certain application/environment that you have gathered?

No, I can try to generate some numbers, though.  But this assumption was made by
several others that commented on the previous review thread. For Throwable
in general, it is also the assumption why the intermediate backtrace data
structure is implemented instead of generating the stackFrames right away.

Since the JEP quotes that NullPointerExceptions are thrown frequently
and swallowed, it would help if you can generate some numbers.

This JEP will help the discussion on the proposed feature and design andavoid any

confusion/assumption of the implementation.

"If the message is printed based on the backtrace, it must be omitted
  if the top frame was dropped from the backtrace."

If NPE includes the hidden frames, `Throwable::getStackTrace` and
`Throwable::toString` and the VM printing of NPE stack trace needs to
filter the hidden frames.  I think it's appropriate for this JEP to
cover rather than in a separate JBS issue.

Hmm, I don't completely understand what you want to say, please
excuse, I'm a foreign speaker :). I'll give a comprehensive answer:
To my understanding this is a consequence of the existing JVM/class file
implementation that calls methods that should be hidden to the user.
The implementation drops these frames when the backtrace datastructure
is computed. If this happens, the information needed to compute the
message is lost, and it can not be printed.


Yes I am familiar with that.

The fact that this happened must be remembered in the VM
so that printing the message can be suppressed. Else a message for
a completely wrong bytecode is printed.


Yup.

I think the JEP should mention that this is happening.  How this is
implemented and then pushed to the repo is not subject to a JEP,
is it?


I think this is related to this JEP.    The question would be:
- should it include all hidden frames?

- should it include the top frame if it's a hidden frame since thisfeature only requires the top frame?

This JEP requires the hidden frame to be recorded in the backtrace. Thishas

impact to the printing of stack trace that needs to know about hidden frames
and decide if it should be filtered or included when printing.

I made a webrev of its own for this problem, so that the code
can be looked at isolated of other, orthogonal issues.
http://cr.openjdk.java.net/~goetz/wr19/8221077-hidden_to_frame/

I don't need the webrev as we are discussing the proposed feature anddesign.

The "Message persistance" section

I read this section like an implementation details

  Yes, I had the feeling that I was talking too much about implementation 
details,
but mostly in the "Basic algorithm" section.
This section discusses characteristics of NPE that differ from other exceptions
and that were proposed in the course of the review of my
original webrev. They are visible to the Java implementor.
So I think this is just the basic information needed in a JEP.

This is a good start that allows us to start the discussion. This JEPwill probably need several iterations of improvement. I'm writing a JEPthat I have iterated many times already and it makes the JEP andproposed feature better. Just to share with you that it's part of theprocess.

I think this section
intends to call out that the message of NPE if constructed by user code
i.e. via public constructor, should not be altered including no-arg
constructor or pass a null message to 1-arg constructor.  The enhanced
NPE message is proposed only when NPE is thrown due to null dereference
when executing bytecode.

Yes.

I don't understand what you tried to say about "npe.getMessage() ==
npe.getMessage()".
Throwable::getMessage does not guarantee to return same String object for
each invocation.

It does not guarantee so, but it's the case for most exceptions. Usually,
one calls super(msg) in an Exception constructor.
It is a design decision to implement it that way, my original proposal
was different. I'm fine with this design, but it should be mentioned I think.


I think the JEP should be made clearer  what you intend to say.

When deserializing the class bytes may be of a different version, so
StackTraceElements
are serialized with its string form.  Serializing NPE with a computed message
seems to be the only viable solution.

I think implementing writeReplace() is a good idea, too. But
Roger does not like it: 
http://mail.openjdk.java.net/pipermail/core-libs-dev/2019-February/058513.html

The JEP should record this either if this is an open issue or makes it aclear proposal to compute the message when NPE is serialized and whatthe client would do (that may be running in a different version)

The "Message content" section

I think this section can break down into clear scenarios
1. debug information is present
2. debug information is absent
3. the matching version of bytecode is not available

I agree that the message should be easy for Java developers including the
beginners to understand the error.

Hmm, only printing local variable names is affected by the debug information.
So I don't think a section of its own is necessary.

If the matching bytecode is not available, nothing can be printed.

See my suggestion above to list all known cases that this proposedfeature will not cover, i.e. no improved message.

So it does not affect the content of the message. Should I call this
section "Message wording"?  I think this is closer to what I want to
discuss in this paragraph.

You bring up a good question about the exception message format e.g.
whether with
quotation (none vs single quote vs double quote).  It's good to have a guideline
on that while this is out of scope of this JEP.  IAE was updated to include
module name, loader name and improve the error message for the developers
to understand the exception which is a good improvement.

IAE quotes the names, which are user defined strings. It does not quote class 
names, method names etc.
Here, I was asked to quote these, which is inconsistent with IAE. But there are 
other
examples where method names are quoted ... So there is no clear line followed
yet.  In the end, I don't care, but want advice how to do it. (I would not 
quote ...).


You can add it to the open issues section.

"Alternatives" section

"The current proposal is to implement this in the Java runtime in C++ accessing
  directly the available datastructures in the metaspace."

"Also, ASM and StackWalker do not support the full functionality needed.
StackWalker must be called in the NullPointerException constructor."
This is not true as you wrote in the subsequent sentence.  If we choose
the implementation to Java, StackWalker will need to be extended to
read Throwable's backtrace.

The senctence above describes the status quo. The current implementation
of StackWalker must be called in the constructor.

I just called out the confusion this section could cause. I think itshould be updated

or rephase.

Such specialized form can provide the
ability to fetch the bytecode in Method* if appropriate.  This is
the implementation details.  As discussed in the email thread, another
implementation choice could be a hybrid of native in the VM and Java
to accomplish this task (for example fetching the bytecode of the
current version could be done in the VM if we agree supporting
the redefinition case).

Yes, obviously we can implement a lot of solutions. Instead of changing
StackWalker, we could simply expose [2] 
java_lang_Throwable::get_method_and_bci()
to Java in NPE.  After all, we only need the method which happens to be stored
as top frame in backtrace.


That's easy but implementation-detail.

For logic that is not strictly needed to be done in the VM, doing it
in Java and library is a good option to consider (putting aside the
amount of new code you have to write).   Looks like you can't evaluate
the alternatives since existing library code requires work in order to
prototype this in Java.  So I'd say more investigation needs to be done
to decide on alternative implementation approaches.

"Testing" section

"The basic implementation is in use in SAP's internal Java virtual machine since
2006."

This is good information.  This assumes if the same implementation goes into
JDK.
So this is subject to the discussion and code review.   I think it's adequate 
to say
unit tests will need to be developed.

There is a unit test in the webrev.  Please point out test cases you think
are missing. [1] stems from this test.


I am not commenting on [1] but the content of this section.

Please refer to the JEP template (http://openjdk.java.net/jeps/2) aboutthe Testing section.

|// What kinds of test development and execution will be required inorder // to validate this enhancement, beyond the usual mandatory unittests? // Be sure to list any special platform or hardware requirements. |

  I think this feature should not impact much of the existing VM code
and so running existing jtreg tests, jck tests and other existing test suites 
should
provide adequate coverage.

I found [1] is quite useful to understand the scenarios this JEP considers.  I
think it
would be useful to include them in the JEP perhaps as examples when
describing
the data flow analysis.

I thought I picked out the most obvious example, pointer chasing, and used it
in the motivation and the description of the basic algorithm.

  [1] can be grouped into several categories and the JEP
can
just talk about the worthnoting groups and does not need to list all messages
such as
dereference a null receive on getfield, null array element vs null array

Yes, I think it makes sense to put more messages into the JEP.


To be clear:

What I was suggesting is to describe the scenarios this proposed featurewill improve the messagebut *not* to cut-n-paste the messages from [1]. So this is not tied towhat the message will be

printed from the implementation.

Writing the scenario in English can help discussing the NPE messagesshould be implemented

that will be easy for Joe developer to understand.

Mandy

But I would
like to do that once it is agreed on the messages, so I don't need to
edit the JEP all the time.
The file referenced is simply generated by the test, so it's easy to update
and sure to reflect the implementation.

This JEP includes a few links to the C++ functions in the current webrev.  I
appreciate that and I assume they will be taken out at some point.

Yes, sure, I can remove that.

Best regards,
   Goetz.

Re: RFR(L): 8218628: Add detailed message to NullPointerException describing what is null.

Reply via email to