Re: [polyml] Update to code-generator and run-time system interface

David Matthews Tue, 18 Oct 2016 07:24:54 -0700

On 18/10/2016 13:43, Makarius wrote:

On 17/10/16 23:58, David Matthews wrote:


Although the lack of garbage collection of code would mean that
repeatedly defining the same function would be a memory leak I would be
surprised if it was a serious problem.  Is it likely that one would
repeatedly redefine the same function within a particular session?


This happens all the time in IDE interaction: things are compiled,
edited, re-compiled; thus old this become inaccessible.

You introduced that principle yourself many years ago, by providing the
very nice PolyML.Compiler interface.

That is one of the big assets of Poly/ML and consequently of Isabelle/ML.

Yes, but the actual memory needed for the function code is not going tobe large compared with the total heap size. We have heaps in the ordersof gigabytes but the whole of Isabelle is just tens of megabytes.

Perhaps I should explain why I made this change and what could be doneto mitigate the effects. There were two reasons. The first was tosimplify the code and avoid the contortions that were necessary and thesecond was for security and long-term stability.

For the garbage collector to be able to compact the heap it has to beable to find and modify all the addresses of heap cells. To do thatvalues are distinguished by a tag bit. If the bottom bit is set thevalue is an integer and is ignored by the GC. Other values areaddresses. An address points always to the start of a cell so will beword aligned, either XXXX00 (32-bit) or XXX000 (64-bit) in the bottombits. Before the start of a cell is a word containing the length of thecell and some bits that indicate whether the cell is a tuple/vectorcontaining values (i.e. either tagged integers or addresses) or is bytedata, typically a string.

This is extended to cope with cells containing machine code. This isjust another type of cell. A code cell is not quite byte data becauseit can contain addresses if there are values that are compile-timeconstants.

Provided we're dealing with the entry point addresses of code cells thisall works fine. The complication comes when the code is actuallyexecuted. If one function calls another, or even itself recursively, ituses the X86 CALL instruction. It is important to use this instructionand not try to do the function call any other way because among otherthings the prefetching hardware recognises CALL/RET pairs. The CALLinstruction pushes the return address, the address of the nextinstruction, to the stack.

This, though, causes problems for the GC. Return addresses areinherently addresses into the middle of cells. They are also on anarbitrary alignment since X86 instructions are not aligned in any way.For the GC to be able to compact the heap it has to be able to find andupdate the return addresses. If Poly/ML used "stack frames" it might bepossible to find return addresses using the frame pointer register butusing frame pointers requires an extra register and increases the costof every function call. Instead the code-generator added no-opinstructions before each CALL such that the return address after theinstruction was on a word+2 byte alignment i.e. XXX10 in the bottombits. This is neither an integer nor a word address so the GC canrecognise these as return addresses. There is still the problem offinding the actual start of the code cell and to do this there is a zeroword at the end of each code cell. That requires that thecode-generator never generates a full word of zeros anywhere else in thecode, or at least not on a word boundary, so there are a few constraintson the code to ensure that is the case.

Removing the code from the normal heap and using a separate, non-garbagecollected area means that these contortions are no longer needed. Thelength word is retained since it is needed when copying the code to anobject file or saved state and when loading from a saved state.

The other reason for making the change is that having the code cells inthe normal heap requires the heap to be given read, write and executepermissions. This is a problem for security and I was concerned that afuture operating system update might ban the use of both write andexecute permissions on the same area of memory. I think this is alreadyan issue with SELinux. Using a separate "code" area avoids this.Although the code area needs to be writable to add new code cells to itthere are tricks that can be used to get round this.

It might be possible to use a non-compacting GC on the code area i.e. tomark code cells that are no longer reachable and then reuse the space.That can lead to fragmentation but would reduce the memory leak. Inalmost all cases a code cell that is reachable will have at least oneaddress in the heap that points at the start. It is, though, possibleto construct pathological cases where the only reference is through areturn address. We're not going to change the return address sincewe're not compacting so it might be possible to get round that byassuming that any value on the stack that looks like it might be areturn address is a sufficient reason to keep the code cell, even if itis actually an integer.

Sorry that's been rather long but I wanted to put on record the thinkingbehind the change and maybe get some other ideas.


David
_______________________________________________
polyml mailing list
[email protected]
http://lists.inf.ed.ac.uk/mailman/listinfo/polyml

Re: [polyml] Update to code-generator and run-time system interface

Reply via email to