Re: [polyml] Reducing the size of on-disk saved state

Makarius Sat, 19 Mar 2016 08:41:02 -0700

On Mon, 15 Feb 2016, David Matthews wrote:

As far I'm aware Isabelle just uses the shared state facilities ofPoly/ML. That provides the ability to save states that are the directdescendant of the executable or the descendant of another saved state.

I've just changed that to use PolyML.SaveState.saveChild andPolyML.SaveState.loadHierarchy -- see

http://isabelle.in.tum.de/repos/isabelle/rev/0189fe0f6452

From a quick look at the code the main effect that child states have isthat StateLoader::LoadFile needs to seek within the saved state file toget the name of the parent file. That has to be loaded before the childbecause the child may, almost certainly will, overwrite some of theparent data. That may affect how you compact the data. How well do thecompression libraries cope with seeking within the file?
From my own point of view I'm concerned that compacting the heap filesmay add to the cost of loading.
I'd like to see what effect adding compaction has on it but it may benecessary to provide separate functions to save states with and withoutcompaction. Loading is easier because the loader can look at theformat. Note that there's no need to provide backwards compatibilitybecause a saved state can only ever be reloaded into the executable thatcreated it.

Overall I don't quite see a benefit to spend a lot of run-time andcomplexity of the implementation to compress a few GB. Even a small SSDhas already 64-128 GB.

What I often see is that a 32bit poly process is unable to save a heap dueto memory shortage. Adding compression in the same process address spacewould probably make the situation worse.

Note that for the present Isabelle setup, it is trivial to add heapcompression outside the poly process, operating on the already saved heapfile. What remains is the problem of loading it again withoutdecompressing into a separate file.

Anyway, what is the state of more detailed measurements of theapplication?

It should be also worthwhile to step back it bit, and ask why the heapsare so large. There might be some Isabelle/ML programming mishaps in theapplication with persistent values that refer to Context.theory instead ofContext.theory_id. Standard containers like the Simplifier or Classicalcontext take care of that, but add-ons might not.



        Makarius

_______________________________________________
polyml mailing list
[email protected]
http://lists.inf.ed.ac.uk/mailman/listinfo/polyml

Re: [polyml] Reducing the size of on-disk saved state

Reply via email to