Re: [External] : Re: Consolidating the user model

Brian Goetz Thu, 04 Nov 2021 11:36:22 -0700

    An implication of universal generics is that there needs to be
    some common protocol that works on both vals and refs. In the
    val/ref model, that protocol is objects: both vals and refs are
    objects with members that can be accessed via '.'. In the
    value/object model, I'm not quite sure how you'd explain it. Maybe
    there's a third concept here, generalizing how values and objects
    behave.
This is on point. I quite honestly forgot that "oh yeah, I don't fullyunderstand universal generics yet", and I'll go work on that. It mightbe death to the model I'm clinging to, but in that case I'll becomepretty good at explaining to people why that model fails, so cool.

Generics are often a clarifying lens through which to look at thisproblem. We've caught ourselves multiple times trying to locallyoptimize, only to find that is an impediment to "generify over all thethings." One of the arguments in favor of "everything is an object" (ora class, or whatever), aside from its natural uniformity, is that thengenerics have a more regular surface to quantify over; generifying overall types is easier when the types have more in common.

For example, one of the reasons to allow the locution "String.ref" as analias for String, while useless, is that it strengthens the notion that".ref" is a total operator, so "T.ref" makes sense simply by appealingto substitution, rather than having to give it a more elaborate definition.

When considering universal (erased) generics, we had to totalize thesemantics of all operations, even when some operations are not allowedunder a strict-substitution interpretation. A quick tour (assume `t` isof type `T`, an unbounded type variable, which is instantiated to `Point`.)

- Assignment to Object or interface (`Object o = t`). In thelanguage, this is considered a primitive widening (nee boxing)conversion, but in the VM, this is mere subtyping (QFoo is-a LFoo). Thismeans that we can use the same `astore` or `putfield` operations tosimply move the value without conversion.

- Assignment to null (`T t = null`). Not all types under T arenullable, but T is still erased to Object. In this case, we assign anull and issue an unchecked warning; if that values bubbles out tonon-generic code, the cast to `Point` will catch the null, and treatthis as a form of heap pollution.

- Array covariance (`Object[] os = ts`). The JVM has been upgraded tosupport array covariance for primitives, where `Point[] <: Point.ref[]`(and transitivity gets us to `Object[]`.)

- Synchronization (`synchronized(t)`). Warnings at compile time, IMSEat runtime.

- Equality (`o == t`). ACMP has been upgraded to understandprimitives, so we can translate as always.

I'm sure I missed a few, but what you see here is a bag of tricks forcreating totality. In some cases (equality, array covariance) weengineered actual totality into the bytecodes; in some cases(synchronization) we rely on compile time warnings and runtime errors;in others, we rely on erasure and lean on existing detection of heappollution.

When moving forward to specialized generics, the constraints getstiffer. We want a model where the _bytecode_ is invariant acrossspecializations, all specialization operates on the constant pool, andspecialization is strictly optional at runtime (meaning erasure is stilla valid runtime strategy.) This might mean that some total-seemingoperations (e.g., T.default) are either outlawed or require complextranslation through a reflective runtime.

All of this is to say, there may be some hidden indirect constraintsthat derive from the desire for a uniform but still specializabletranslation.

Re: [External] : Re: Consolidating the user model

Reply via email to