[Haskell-cafe] Implementing the State Monad (Was: Can every monad can be implemented with Cont?)

apfelmus Sun, 11 Nov 2007 10:10:30 -0800

jeff p wrote:

Didn't someone already prove all monads can be implemented in terms
of Cont?


Cont and StateT, wasn't it?
And the schemers have no choice about running in StateT :)


You sure? I want to see the proof :)


I think this is referring to Andrzej Filinski's paper "Representing
Layered Monads" in which it shown that stacks of monads can be
implemented directly (no layering) by using call/cc and mutable state.

Thanks for the reference! I still don't understand Filinski's papersenough to say whether there's more to the embedding than just


  type Cont m a = forall b . (a -> m b) -> m b

  reify :: Monad m => Cont m a -> m a
  reify f = f return

  reflect :: Monad m => m a -> Cont m a
  reflect x = (x >>=)

Does this already give a performance benefit without further inlining? Idoubt it.

Anyway, all this leads to a fun and easy way to implement monads, forexample the state monad. We have the primitive operations


  type State s a

  get :: State s s
  put :: s -> State s ()

together with the usual monad laws and the operational semantics

  evalState :: State s a -> (s -> a)

  evalState (return x)    = \_ -> x

evalState (get >>= k) = \s -> evalState (k s ) s -- make the states available to k

  evalState (put s >>= k) = \_ -> evalState (k ()) s  -- use a new state  s

Why "operational semantics"? Well, we're just specifying what happenswhen we "execute" a get or put "instruction" by saying how theexecution proceeds with the next instruction k pointed out by >>=.

We're not using the "usual" and "elaborate" type like s -> (a,s) tothread the state around, we're using a humble function s -> a tospecify that a value a depends on some state s . The operationalsemantics will do the state plumbing for us. In other words, the wedon't have to come with a special implementation like s -> (a,s) thatworks, we will *mechanically* get one from our intended operationalsemantics.

Now, how to implement? Well, the best way to start is to represent eachprimitive operation with a new constructor: (GADT notation, needs the-XGADTs flag)


  data State :: * -> * -> * where
    Return :: a -> State s a
    (:>>=) :: State s a -> (a -> State s b) -> State s b
    Get    :: State s s
    Put    :: s -> State s a

With this _term representation_, we can implement the operationalsemantics by

evalState ((m :>>= n) :>>= k) = evalState (m :>>= (n :>>= k)) --monad law associativityevalState (Return x :>>= k) = evalState (k x) --monad law left unitevalState (Get :>>= k) = \s -> evalState (k s ) s --semantics of getevalState (Put s :>>= k) = \_ -> evalState (k ()) s --semantics of putevalState (Return x) = \_ -> x --semantics of returnevalState m = evalState (m :>>= Return) --monad law right unit

Neat, isn't it? Every law and every specification for the primitiveinstructions has been used exactly once.

Simple and painless, but not fully optimized yet. With the firstequation, using :>>= left-associatively has similar problems like using++ left-associatively. Both


  concat'    = foldl (++) []
  sequence_' = foldl (>>) (return ())

would show quadratic time behavior. So, just like with difference lists,the idea is to represent the operations in the monad together with the_context_ they are commonly used in. For concatenating lists, the context is


  (xs ++ _)

so that every list  xs  is represented by

  \ys -> (xs ++ ys)

For our state monad, the context is

  evalState (m :>>= _)

i.e., we make evaluation and sequencing "built-in". More specifically,we also make the evaluation of the next instructions built-in, so thatevery monadic action m will be represented by


  \k -> evalState (m :>>= k')  where  k = evalState . k'

In other words, we will represent the type  State s a  by

  State' s a = forall b . (a -> (s -> b)) -> (s -> b)

But this is just the continuation monad!

  data Cont   m a = Cont (forall b . (a -> m b) -> m b)
  type State' s a = Cont (s ->) a

with m a = s -> a the result type of our semantics evalState . So, wealready get >>= and return for free, knowing that they fulfill themonad laws


  return x = \k -> k x
  m >>= f  = \k -> m (flip f k)

Our custom primitive operations get and put are straightforward toimplement


  get   = \k -> evalState (Get :>>= k')
        = \k -> \s -> evalState (k' s ) s
        = \k -> \s -> k s s

  put s = \k -> \_ -> k () s

These definitions are crystal clear from their operational semantics,given some practice reading them. Last but not least, there isevalState which implements the behavior of the Return instruction


  evalState m
     = evalState (m' :>>= Return)
     = (\k -> evalState (m' :>>= k')) (evalState . Return)
     = m (\x -> evalState (Return x))
     = m (\x -> \_ -> x)

That's it for the state monad. For reference, here's the full implementation

  type State s a = Cont ((->) s) a

  get         = \k s -> k s  s
  put s       = \k _ -> k () s
  evalState m = m const

We get >>= and return for free from the predefined Cont (assumingthat the "done right"-version with universal quantification would be inthe libraries, that is).

Of course, this approach isn't limited to the state monad. Here are someparser combinators


  type Result a = String -> [a]
  type Parser a = Cont (Result) a

  run  p  = p (\x i -> if null i then [x] else [])

  symbol  = \k i -> case i of { c:cs -> k c; [] -> []; }
  fail    = \k i -> []
  p +++ q = \k i -> p k i ++ q k i

Can you see the operational semantics? (Think of p k as run (p >>=k)). If not, stick to the term implementation and check out Unimo belowfor a free >>= :)

This simple way of implementing monads by their operational semantics isknown for quite some time


  John Hughes. The Design of a Pretty-printing Library.
  http://citeseer.ist.psu.edu/hughes95design.html

  Chuan-kai Lin. Programming Monads Operationally with Unimo.
  http://web.cecs.pdx.edu/~cklin/papers/unimo-143.pdf

and is in fact related to the good old continuation passing style of IOand parser combinators. But I think it's powerful and I'd like it to bewell-known.



Regards,
apfelmus

PS: Put differently, the question of the original thread "Can everymonad can be implemented with Cont?" is whether Unimo can implementstrictly more monads than Cont.


_______________________________________________
Haskell-Cafe mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/haskell-cafe

[Haskell-cafe] Implementing the State Monad (Was: Can every monad can be implemented with Cont?)

Reply via email to