Re: [polyml] extremely bad performance on some mllex generated code

David Matthews Tue, 03 Mar 2015 05:49:47 -0800

An update on this. I've committed some changes that seem to have solvedthe problem. It's probably of general interest so I'll provide a bit ofbackground.

It has to do with the optimisation of closures. In general an MLfunction needs to be represented by a closure, an item on the heapconsisting of a pointer to the code and copies of the free variablesneeded by the function. A naive implementation would build a closurefor each function as it was declared but it's possible to do much betterthan that. If we can be sure that a function is only ever called andnot returned or assigned to a reference we don't need to put the closureon the heap. Poly/ML has used a couple of different techniques but morerecently it has handled these cases by building the closure on thestack. A stack closure looks just like a heap closure so the callingconventions are the same. That means it can be used both when we canidentify all the call sites of the function but also if it is passedinto a function such as map or fold. (Actually, List.map andList.foldl/r are small enough that they're inlined but the principle isthe same). The disadvantage is that a call to a function with astack-closure cannot be tail-recursive because the closure needs to stayon the stack until the function returns. This is what went wrong withthis example.

Another solution, which works when all the call sites of a function canbe identified but not when a function is passed as an argument, islambda-lifting. This involves adding the free variables as extraarguments to the function. The resulting function does not need aclosure. I've implemented this for local functions where all the callsites can be found and left the stack closures for the other cases. Itis this change that has fixed the problem since lambda-lifted functionscan be tail-recursive.

It may be that lambda-lifted functions are more efficient for otherreasons. Functions without closures can be code-generated using callsor jumps directly to the function rather than requiring an indirectionthrough the closure. This may fit better with the pre-fetching ofmodern hardware.


David

On 05/02/2015 05:00, Michael Norrish wrote:

(I'll attempt to attach the tgz file to this message, but if 817KB is too big
and this bounces or the attachment is dropped, I'll make it available 
separately.)

If compiled with Poly/ML, the attached program performs abysmally on the
provided testcase.sml input file.  Within seconds of beginning, it attempts to
allocate more memory than my machine has (16GB), and basically brings it to its
knees.  If compiled with mlton, the testcase doesn't seem to cause any obvious
memory allocation, and finishes in less than a second.

You can also compile with Moscow ML, though I haven’t included the build
instructions in the Makefile.  The execution of the mosml executable doesn't
seem to allocate any memory either, and it terminates cleanly (after 17s).
Though slow this is still preferable to Poly/ML's behaviour.

This is with Poly/ML 5.5.2 on

Linux telemachus 3.2.0-75-generic #110-Ubuntu SMP Tue Dec 16 19:11:55 UTC 2014

x86_64 x86_64 x86_64 GNU/Linux

The Makefile generates two executables (if you have mlton and poly in your
PATH): pholdeptool and mholdeptool.  You can run them on the testcase, e.g.:

     pholdeptool testcase.sml

The lexer code looks to me to be tail-recursive, and the program only
accumulates a binary tree containing 15 strings.  So it really does seem as if
Poly/ML is doing something buggy.

(The program is lexing the test-case (which uncompresses to a 30MB file) in an
extremely simple way, but performing a useful analysis.)

Michael



_______________________________________________
polyml mailing list
[email protected]
http://lists.inf.ed.ac.uk/mailman/listinfo/polyml

_______________________________________________
polyml mailing list
[email protected]
http://lists.inf.ed.ac.uk/mailman/listinfo/polyml

Re: [polyml] extremely bad performance on some mllex generated code

Reply via email to