Re: [Haskell-cafe] Re: Proposal: Sum type branches as extended types (as Type!Constructor)

wren ng thornton Mon, 07 Jun 2010 11:40:47 -0700

Gabriel Riba wrote:

New proposal draft:


Proposal: Type supplement for constructor specific uses of sum types

Purpose: Avoid error clauses (runtime errors), exception control or Maybe
 types in partially defined (constructor specific) functions on sum types.

As an example, with

data List a = Nil | Cons a (List a)* Actual system, with runtime errors (as in GHC Data.List head) orexception throwing or optional Maybe results.


   hd :: List a -> a
   hd (Cons x _) -> x
   hd Nil -> error "error: hd: empty list" -- error or exception throwing

* Proposed system extending types with a suffix @ Constructor or @
{Constructor1, Constructor2, ..}

   hd :: List @ Cons a -> a
   hd (Cons x _) = x

The caller must do pattern matching before applying the constructor-specific
function.

Since the goal is to propagate case analysis information, the syntaxshould reflect that. That is, there should be support for nestedpatterns, e.g.,


    cdar :: [...@{_:_:_} -> a
    cdar (_:x:_) = x

    cadr :: [[...@{(_:_):_} -> a
    cadr ((_:xs):_) = xs

    headFromJust :: [Maybe a...@{just _ : _} -> a
    ...

The t...@cons syntax is a nice shorthand, but we need to express thearguments to the data constructors in the extended syntax in order tosupport nested patterns.

For delimiting multiple alternatives, it's not clear that comma is thebest delimiter to use, especially since it could be the data constructorfor tuples. Perhaps using ; or | would be better. Unless there's asyntactic reason for preferring braces over parentheses, perhaps weshould just use parentheses for symmetry with as-patterns.

Finally, there should also be support for negative patterns, i.e.,propagation of the failure to match a pattern. One place this is usefulis for distinguishing 0 from other numbers, which allows removing theerror branches from functions like division. Sometimes we can'tenumerate all the positive patterns we want to allow, but it's easy toexpress what should be disallowed.

To match case analysis we should allow for a conjunction of negativepatterns followed by a positive pattern. Or, if we want to incorporatemultiple positive patterns, then a conjunction of negative patternsfollowed by a disjunction of positive patterns. (Disjunctive casematching has been an independent proposal in the past, and there'snothing prohibiting supporting it.)

Thus, if we use | to delimit disjunctions, & to delimit conjunctions,and \\ to separate the disjuncts from the conjuncts, given the followingcase analysis:


    case x : T of
    p1 y1... -> e1
    p2 y2... -> e2
    _        -> eF

The variable x has type T outside of the case expression. Within thebranch e1 it is given the refinement type t...@{p1 _...} where variablesbound by the pattern are replaced with wildcards. In branch e2 it isgiven the refinement type t...@{p2 _... \\ p1 _...}. This can be simplifiedto t...@{p2 _...} if the head constructors of p2 and p1 are distinct. Andin the eF branch x would be given the refinement type t...@{_ \\ p1 _... &p2_...}.

If this semantics is too hard to implement, we could instead require theuse of as-patterns for introducing the refinements. The variableintroduced by @ in the as-pattern would be given the refinement type,but the scrutinee would continue to have the unrefined type. This lattersemantics is common in dependently typed languages, but it's verbose andugly so it'd be nice to avoid it if we can.


Other notes:

Case matching on non-variable expressions would gain no extra support,since we have no variable to associate the refinement information with(unless as-patterns are used).

A refinement type t...@{p1} can always be weakened to t...@{p1 | p2}.Similarly, a refinement type can always be weakened by erasing it.

For type inference, I'd suggest that functions which do not have rigidsignatures are treated the way they currently are; that is, allrefinement information is weakened away unless it is explicitlyrequested or returned by a function's type signature. This could beimproved upon later, but seems like the most reasonable place to start.One complication of trying to infer refinement types is that if we aretoo liberal then we won't catch bugs arising from non-exhaustive patternmatching.

Syntax-wise, there's no particular reason for distinguishing differencefrom conjunctions under difference. That is, the type t...@{... \\ p1 & p2}could just as well be written t...@{... \\ p1 \\ p2}. And there's no needfor conjunctions under disjunction because we can unify the patterns toget their intersection. Thus, it might be best to just have disjunctionand difference for simplicity.


--
Live well,
~wren
_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Re: Proposal: Sum type branches as extended types (as Type!Constructor)

Reply via email to