Re: [polyml] Garbage collection issue with functional input streams

Dave Berry Tue, 02 Sep 2008 13:46:22 -0700

An interesting question! I don't think your interpretation was what weintended - the input function was supposed to return whatever content wasavailable at the time of the call. Given that Andrew and John were themain instigators of the functional IO subsystem and they wrote the SML/NJversion, it would seem safe to go with the SML/NJ behaviour.


Best,


Dave.



At 12:49 01/09/2008, David Matthews wrote:

Philip Clayton wrote:
I have found a performance issue when using TextIO.StreamIO.input1 toread a functional stream. Looking at gc/non-gc times and usingPolyML.profiling, it appears that garbage collection accounts for most ofthe time. There is some code below to demonstrate with stats thatinclude comparison with SML/NJ.The profiling shows that readFromReader in basis/BasicStreamIO.sml isresponsible for creating values that are being garbage collected.Looking at this code, I can see various things that would contribute tothis garbage collection but nothing that is obviously problematic. Is itsimply the case that overheads in the implementation mean that it is notsuitable for a large number of small reads?
I think I need to look again at the functional IO part of Poly/ML's basislibrary.
The idea of functional IO is that a stream should be repeatable. i.e. ifa stream, f, has returned some data then re-reading from the stream shouldreturn the same data. The definition of functional IO in the basislibrary that I used when implementing this in Poly/ML had a number ofprogram snippets that implied that it was not just the content that had tobe repeatable but also the way the content was broken down. So, if thestream was read using "input1" to return a single character then asubsequent call to "input" on that same functional stream must returnprecisely one character.
> val str = getInstream(openIn "/tmp/abc");
val str = ? : TextIO.StreamIO.instream
> StreamIO.input1 str;
val it = SOME (#"0", ?)
: (TEXT_STREAM_IO.elem * TextIO.StreamIO.instream) option
> StreamIO.input  str;
val it = ("0", ?) : TEXT_STREAM_IO.vector * TextIO.StreamIO.instream
However, I think when the book was published many of these examples wereleft out and although it doesn't seem to be stated formally I think theidea is that only the content needs to be repeatable. So "input" shouldreturn a string whose first character is the same as that returned by"input1" but whose length is unspecified. This seems to be what SML/NJ atleast does. It looks like the problem for your example is that Poly/ML isbuilding up a enormous stream of single character elements and that thisis overwhelming the storage management.
Although the basis library defines imperative IO in terms of functional IOthe implementation in Poly/ML is different so that it doesn't suffer fromthese problems.
David
_______________________________________________
polyml mailing list
[email protected]
http://lists.inf.ed.ac.uk/mailman/listinfo/polyml



_______________________________________________
polyml mailing list
[email protected]
http://lists.inf.ed.ac.uk/mailman/listinfo/polyml

Re: [polyml] Garbage collection issue with functional input streams

Reply via email to