Refined type checking for GADTs (was: Pattern matching: next steps after JEP 405)

Brian Goetz Tue, 24 May 2022 11:56:54 -0700


 - *Refined type checking for GADTs. *Given a hierarchy like:

    sealed interface Node<T> { }
    record IntNode(int i) implements Node<Integer> { }
    record FloatNode(float f) implements Node<Float> { }

we currently cannot type-check programs like:

    <T> Node<T> twice(Node<T> n) {
        return switch (n) {
            case IntNode(int x) -> new IntNode(x*2);
            case FloatNode(float x) -> new FloatNode(x*2);
       }
   }

because, while the match constraints the instantiation of T in eacharm of the switch, the compiler doesn't know this yet.

Much of this problem has already been explored by "Generalized AlgebraicData Types and Object Oriented Programming" (Kennedy and Russo, 2005);there's a subset of the formalism from that paper which I think canapply somewhat cleanly to Java.

The essence of the approach is that in certain scopes (which coincideexactly with the scope of pattern binding variables), additional _typevariable equality constraints_ are injected. For a switch like thatabove, we inject a T=Integer constraint into the first arm, and aT=Float into the second arm, and do our type checking with theseadditional constraints. (The paper uses equational constraints only(T=Integer), but we may want additional upper bounds as well (T <:Comprable<T>)).

The way it works in this example is: we gather the constraint Node<T> =Node<Integer> from the switch (by walking up the hierarchy and doingsubstitution), and unifying, which gives us the new equationalconstraint T=Integer. We then type-check the RHS using the additionalconstraints.

The type checking adds some new rules to reflect equational constraints,FJ-style:


   \Gamma |- T=U   \Gamma |- C<T> OK
   --------------------------------- abstraction
       \Gamma |- C<T> = C<U>

   \Gamma |- C<T> = C<U>
   --------------------- reduction
       \Gamma |- T=U

   \Gamma |- X OK
   --------------  reflexivity
   \Gamma |- X=X

   \Gamma |- U=T
   -------------  symmetry
   \Gamma |- T=U

   \Gamma |- T=U  \Gamma |- U=V
   ----------------------------  transitivity
   \Gamma |= T=V

    \Gamma |- T=U
   ---------------- subtyping
   \Gamma |- T <: U

The key is that this only affects type checking; it doesn't rewrite anytypes. Since in the first arm we are trying to assign a IntNode to aNode<T>, and IntNode <: Node<Integer>, by symmetry + subtyping, we getIntNode <: Node<T>, and yay it type-checks.


The main moving parts of this sub-feature are:

- Defining scopes for additional constraints/bounds. This canpiggyback on the existing language of the form "if v is introduced whenP is true, then v is definitely matched at X"; we can trivially extendthis to say "a constraint is definitely matched at X". This is almostpurely mechanical. - Defining additional type-checking rules to support scope-specificconstraints, along the lines above, in 4.10 (Subtyping). - In the description of type and records patterns (14.30.x), appeal toinference to gather equational constraints, and which patterns introducean equational constraint.


This is obviously only a sketch; more details to follow.

Refined type checking for GADTs (was: Pattern matching: next steps after JEP 405)

Reply via email to