Re: Model 3 classfile design document

Brian Goetz Wed, 17 Feb 2016 09:09:58 -0800

Having discussed the classfile representation and sketched out someplausibility arguments about how the VM can efficiently managespecialization, let's step back and look at the consequences for whatthis means for the language (both Java and other languages.)

Type -> Class mapping. With erased generics, all parameterizationsFoo<T> map to a single class Foo. In the Model 3 model, the classfilefor Foo is essentially a template; we can request parameterizations ofFoo via the ParamType constant (the Class constant Class[Foo] becomesretconned to mean ParamType[Foo, erased].)

Reflection. In the current prototype, Foo<int> and Foo<String> aredistinct classes; each will respond with distinct .getClass() results.We don't yet have a means to express that Foo<int> and Foo<String> aredifferent "species" of Foo; instead each get their own class mirror.Reflective operations like Foo<int>.class.getName() currently yield uglyresults. Lots of open questions here.

Reification. The question on everyone's mind will be: are we "finallygetting reified generics"? And the answer is: sort of. (This questionalso comes with a lot of baggage; there are a lot of people who assumethat erasure is somehow "smelly" and therefore bad, and so of coursereification must be better. But erasure is a pragmatic compromise, andthe alternative is not always better. Let's try and leave the baggageat the door for now.)

To add to the confusion, not everyone means the same thing by "reifiedgenerics". To some, reification means "types are checked at runtime";to others, it may merely mean "types are reflectively available atruntime." Even within the first category, there's a range of what sortof type checking we might mean, since the VM type system may not beexactly the same type system as the language-level type system -- andfor good reason. (What if we ask for a reified ArrayList<? extendsList<? extends Foo> & Serializable>? Do we get runtime subtype checkingfor wildcards and intersections every time we try to put something inthis List? Would we even want that? Are we sure such checks aredecidable?)

In Model 3, specialization is clearly a form of reification; when wespecialize ArrayList to E=int, the backing store is an int[], andtherefore we get all the type checking that entails. We can clearlylayer additional support for reflectively exposing the bindings of typeparameters in a number of ways.

The Model 3 classfile design explicitly admits both reified and erasedgenerics at the VM level, by allowing a concrete type descriptor *or*the 'erased' token as a type parameter to a ParameterizedType. (Notethat 'erased' is not a type, it is merely an allowed typeparameterization -- similar to wildcards in in the Java language.)There is nothing in the classfile design that encodes the rule"reference parameterizations are erased"; that's the choice of thelanguage compiler. In this way, we can consider any non-erasedparameterization to be reified; a ParamType[ArrayList, LString] willthrow ArrayStoreException at runtime if you try to cram something otherthan a String into it.

So, does that mean generics are reified? Sort of... For multiplereasons (including, but not exclusively compatibility), the current planis for the Java language to continue to use erasure for referenceparameterizations of generics. But other languages are free to use fullreification where it suits them (and if their Java interop requirementslet them.) If someone uses reflection to reflect over a List<String>and ask for its type parameter, it will come back as "erased"(reflection has to support this answer anyway, if only for compatibilitywith legacy code.)

So the punchline is, at the Java language, generics are erased *and*reified; generics over references are erased (as they are today) andgenerics over values are reified. I suspect people will be about asjarred by this as they were by erasure in the first place; I expectwe'll get some degree of "You idiots, you ran 99 yards only to fumblethe ball on the 1 yard line." But looking past this (which is mostlythe above-mentioned baggage), the model seems sound enough; existingreference generics work as they always have, and new value generics work"better" (in that there are additional things you can do with them.)

In fact, it gives us a chance to be more honest about erasure, because"erased" can appear as a first-class member of the programming model. Ibelieve much of the complaints about erasure stem from the fact that itis inevitably a surprise when you first discover it.






On 1/22/2016 11:52 AM, Brian Goetz wrote:

Please find a document here:
http://cr.openjdk.java.net/~briangoetz/valhalla/eg-attachments/model3-01.html
that describes our current thinking for evolving the classfile formatto clearly and efficiently represent parametric polymorphism. Theearly concepts of this approach were outlined in my talk at JVMLS lastyear; this represents a refinement of those ideas, and a reasonable"stake in the ground" description of what seems the most sensible wayto balance preserving parametric information in the classfile withoutimposing excessive runtime costs for loading specializations.

Re: Model 3 classfile design document

Reply via email to