Re: [Haskell-cafe] What unsafeInterleaveIO is unsafe

wren ng thornton Sun, 15 Mar 2009 18:04:53 -0700

Yusaku Hashimoto wrote:

Hello,


I was studying about what unsafeInterleaveIO is.I understood
unsafeInterleaveIO takes an IO action, and delays it. But I couldn't
find any reason why unsafeInterleaveIO is unsafe.

I have already read an example in
http://www.haskell.org/pipermail/haskell-cafe/2009-March/057101.html
says lazy IO may break purity, but I think real matter in this example
are wrong use of seq. did I misread?

For example: I have some universal state in IO. We'll call it an IORef,but it could be anything, like reading lines from a file. And I havesome method for accessing and updating that state.


> next r = do n <- readIORef r
>             writeIORef r (n+1)
>             return n

Now, if I use unsafeInterleaveIO:

> main = do r <- newIORef 0
>           x <-  do a <- unsafeInterleaveIO (next r)
>                    b <- unsafeInterleaveIO (next r)
>                    return (a,b)
>           ...

The values of a and b in x are entirely arbitrary, and are only set atthe point when they are first accessed. They're not just arbitrarybetween which is 0 and which is 1, they could be *any* pair of values(other than equal) since the reference r is still in scope and othercode in the ... could affect it before we access a and b, or between thetwo accesses.

The arbitrariness is not "random" in the statistical sense, but ratheris an oracle for determining the order in which evaluation has occurred.Consider, as an illustration these two alternatives for the ...:


>           fst x `seq` snd x `seq` return x

vs

>           snd x `seq` fst x `seq` return x

In this example, main will return (0,1) or (1,0) depending on which waschosen. You are right in that the issue lies in seq, but that's a redherring. Having made x, we can pass it along to any function, ignore theoutput of that function, and inspect x in order to know the order ofstrictness in that function.

Moreover, let's have two pure implementations, f and g, of the samemathematical function. Even if f and g are close enough to correctlygive the same output for inputs with _|_ in them, we may be able toobserve the fact that they arrive at those answers differently bypassing in our x. Given that such observations are possible, it is nolonger safe to exchange f and g for one another, despite the fact thatthey are pure and give the same output for all (meaningful) inputs.

This example is somewhat artificial because we set up x to useunsafeInterleaveIO in the bad way. For the intended use cases where itis indeed (arguably) safe, we would need to be sure to manually threadthe state through the pure value (e.g. x) such that the final value issane. For instance, in lazy I/O where we're constructing a list oflines/bytes/whatever, we need to ensure that any access to the Nthelement of the list will first force the (N-1)th element, so that weensure that the list comes out in the same order as if we forced all ofthem at construction time.

For things like arbitrary symbol generation, unsafeInterleaveIO isperfectly fine because the order and identity of the symbols generatedis irrelevant, but more importantly it is safe because the "IO" that'sgoing on is not actually I/O. For arbitrary symbol generation, we coulduse unsafeInterleaveST instead, and that would be better because itaccurately describes the effects. For any IO value which has real I/Oeffects, unsafeInterleaveIO is almost never correct because the orderingof effects on the real world (or whether the effects occur at all)depends entirely on the evaluation behavior of the program, which canvary by compiler, by compiler version, or even between different runs ofthe same compiled binary.


--
Live well,
~wren
_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] What unsafeInterleaveIO is unsafe

Reply via email to