Re: [Haskell-cafe] Musings on type systems

wren ng thornton Fri, 19 Nov 2010 22:27:59 -0800

On 11/19/10 10:05 PM, Ryan Ingram wrote:

On Fri, Nov 19, 2010 at 1:05 PM, Andrew Coppin wrote:

So is Either what is meant by a "sum type"?
Similarly, (X, Y) [...] is this a "product type"?

Yes and no. Unfortunately there's some discrepancy in the terminologydepending on who you ask. In the functional programming world, yes: sumtypes are when you have a choice of data constructors, a la Either; andproduct types are when you have multiple arguments in a dataconstructor, a la tuples.[1]

However, in set theory and consequently in much of the research ondependent types, (the dependent generalization of) function types arecalled ``(dependent) product types'' and (the dependent generalizationof) tuples are called ``(dependent) sum types''. There's a convolutedstory about why this supposedly makes sense, but it doesn't matchfunctional programmer's terminology nor the category theoreticterminology which is often invoked in type theory.

ObTangent: this is much like the discrepancy between what is meant by``source'' and ``target'' for folks who come from a machine learningbackground vs people who come from a signal processing background.Thankfully, most of the NLP folks caught in the middle have decided togo with the sensible (ML) definitions.

[1] In a lazy language like Haskell we have to be careful about how wephrase this. There are different notions of products[2] depending on howthey behave with respect to strictness, and depending on which one youchoose you'll change how you have to reason about the types abstractly.This shows up canonically in the difference between domain products andsmash products. When Haskell was designed they decided not to have twodifferent versions of products in the language, so the tuples in Haskellaren't either of these two well-behaved kinds of products. This hasramifications when people try to reason about which programtransformations are valid without introducing too much or too littlelaziness. By and large Haskell's tuples and ADTs are good at doing whatyou mean, but they do complicate the theory.

[2] The same is true for different kinds of sums, but that's lessproblematic to deal with.

Notionally (->) is just another
type constructor, so functions aren't fundamentally different to any other
types - at least, as far as the type system goes.


Sort of, but I think your discussion later gets into exactly why it
*is* fundamentally different.

There are a few different ways to think about functions/arrows, which iswhy things get a bit strange. In functional programming this ishighlighted by the ideas of ``functions as procedures'' vs ``functionsas data'' ---even though we like to ignore the differences between thosetwo perspectives. In category theoretic terms, those ideas correlatewith morphisms vs exponential objects (or coexponential objects,depending). There's a category theoretic relation between exponentialsand products (i.e., tuples) which is where un/currying comes from. Butthis is also why the Pi- and Sigma-types of dependently typed languagescause such issues.

For example, there's an isomorphism between A->(C^B) and (A*B)->C incertain categories, namely curry/uncurry. And there's also anisomorphism between A*B and B*A, namely swapping the elements of a pair.Together these mean, A->(C^B) ~= (A*B)->C ~= (B*A)->C ~= B->(C^A). InHaskell this is obviously true because we have Prelude.flip. However, ifwe generalize this to dependent functions and dependent pairs then it'sno longer true in general, because B may require an A to be in scope inorder to be well-kinded; e.g., assuming f : (a:A) -> (b: B a) -> C a b,then what is the type of swap f?

So on the one hand arrows and products are just type constructors likeany other, but on the other hand they're not. It's sort of like how zeroand one are natural numbers, but they're specialer than the othernatural numbers (you need them in order to define the rest of Nat; theyhave special behavior with respect to basic operations like (+),(*),...; etc).

ObTangent: When we dualize things to co-Cartesian closed categories weget the same thing, except it's between sums/coproducts and coexponentials.

Where *the hell* do GADTs fit in here? Well, they're usually used with
phantom types, so I guess we need to figure out where phantom types fit in.


Well, I find it's better to think of GADTs as types that have extra
elements holding proofs about their contents which you can unpack.

Ultimately, GADTs are just a restricted form of Pi- and Sigma-types. Thetype argument whose value varies depending on the constructor isn'tactually a phantom type. You can think of there being four sorts of typevariables. There are the variables for parametric polymorphism where the_same_ variable occurs on both sides of the = defining the type. Thereare phantom types where the variable only occurs on the left. There areexistential types where the variable only occurs on the right. And thereare dependent types which are like a combination between phantomvariable on the left, an existential variable on the right, and anequality constraint relating the left variable to the right variable.


--
Live well,
~wren
_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Musings on type systems

Reply via email to