ParameterizedType encoding (was: Model 3 classfile design document)

Brian Goetz Thu, 24 Mar 2016 14:48:06 -0700

The complex structure you refer to exists in two places:

- The GenericClass attribute, which describes the structure of thetype variables;

 - The ParameterizedType attribute, through its 'enclosing' field.

The structure of these two need to line up, which opens up various sortsof possibility for error, which must be checked at runtime. So, why didI propose this, and not just flattening the tvars into a linear array?

The proposed form is derived from some (presumed) compatibilityrequirements. These leak in because, even though Outer<T> and Innerare derived from the same source file (and presumably therefore theirclassfiles will always be consistent), *other* classfiles can describeOuter<T>.Inner using ParamType, and its possible that Outer/Inner canbe modified without recompiling the client.

Let's make these compatibility requirements more explicit. In theabsence of qualification, "compatible" means "binary and sourcecompatible for clients and subclasses."

1. Renaming a type variable should be compatible. The rationale hereis, the choice of parameter names is an implementation detail, and itsreasonable to treat the choice of type variable names the same way. Weaccomplish this by encoding type variable uses with indexes instead ofnames -- but this depends on the stability of indexes.

2. (Non-requirement.) We don't require that reordering or removingtype variables be compatible. While it would be nice if we could dothis, the number and order of type variables is part of a classesinterface definition. A Map<K,V> is a map from K to V; the order isfixed when we first publish the class.

Note that (1) and (2) match the story for method argument lists today;you can compatibly rename parameters, but not remove or reorder them.(See below for adding.)

3. Anyfying an existing erased type variable should be compatible; Ishould be able to evolve class Foo<T,U> to be class Foo<any T, U>. Therationale is obvious; if this were not the case, we couldn't anyfy anyof our existing libraries.

So far, nothing too controversial. Now, let's move on to some "would benice" evolution cases.

4. Generifying a non-generic class by adding one or more erased typevariables should be (at least binary) compatible. This means that if wehave Outer.Inner, it would be nice if we could evolve this toOuter<T>.Inner. And similarly, if we have Outer<T>.Inner, it wouldbe nice if we could evolve this to be Outer<T>.Inner.


(Note that #4 + #3 means we can also add any-tvars too.)

This is the case with existing erased generics; so long as we followsome rules, we can generify existing classes without breaking clients.But, adding type variables somewhere in the chain has the potential toperturb the numbering scheme for type variables, conflicting with #1above. Note that whether we start numbering outer-to-inner, orinner-to-outer, a continuous numbering system will be perturbed by oneof the above scenarios.

5. (Weak requirement.) It would be nice if adding new erased typevariables to the end of the type variable argument list were also binarycompatible, as it is today with erased generics. This also has thepotential to perturb continuous numbering schemes.

Given these requirements, gleaned from existing compatibility behaviors,nudged us in the direction of explicitly modeling Outer<T>.Inner as achain of parameterized type descriptors, rather than one flatteneddescriptor. When we encounter a PT, we match its structure (loosely, toaccount for #4 and #5) to the GenericClass descriptor for the describedtemplate class, as well as validating type parameters (e.g., that wehaven't passed "I" to an erased type parameter.)

Alternately, we could have encoded type variables as (owner, indexwithin owner) pairs. But, even with such an encoding, we would have togo through a similar validation process as above; this changes therepresentation, but not the amount of work.

Alternately alternately, we could encode a snapshot of theas-of-compile-time contents of the GenericClass, so that separatecompilation changes can be detected and possibly corrected. But thisseems overkill.








On 3/22/2016 4:21 PM, John Rose wrote:

The full display of type variables, with all their definition sites, strikes me 
as clunky, from a VM perspective. It's a large amount of AST info.

For inner classes, we flatten up level references by introducing synthetic 
variables and fields. In a few places core reflection needs an attribute to map 
backward but the executable part is all flattened. This makes it easier to 
execute and compile.

Could we do a similar trick for type variables?  I.e. represent up-level type 
vars as a flat sequence of synthetic local copies.

– John

On Feb 11, 2016, at 2:24 PM, Bjorn B Vardal <[email protected]> wrote:

where Inner doesn't declare any type variables, my understanding is that Inner 
will still have the GenericClass attribute because it may refer to T. Will 
Inner still appear as the first class frame, with tvarCount=0, enforcing the 
rule that the first element is always the class itself?

ParameterizedType encoding (was: Model 3 classfile design document)

Reply via email to