D's memory-hungry templates

tsbockman via Digitalmars-d Thu, 09 Jun 2016 07:51:18 -0700

While working on a small PR(https://github.com/dlang/phobos/pull/4420), I noticed that D'stemplate computation system has horrific memory consumption (aswell as being very slow).


I believe there are several reasons for this:

1) All template instantiations are memoized, even if they're justprivate internal implementation details. This causes the spacecomplexity for recursive algorithms to generally be just as badas the time complexity, whereas it should usually be much better.

2) Everything is immutable, which sometimes forces O(N) arrayalgorithms to be replaced with O(N log(N)) tree algorithms. Thiscompounds with (1).

3) Even though D *requires* that all template algorithms berecursive, the recursion limit is actually set very low (500).This forces efficient linear recursion O(N) algorithms to bereplaced with wasteful O(N log(N)) binary recursion, unless N isguaranteed to be very small. This compounds with (1), andsometimes (2).

The combination of these issues causes very severe memoryconsumption and speed problems; as an example`staticSort!(aliasSeq!(iota(N)))`, which should have a timecomplexity of O(N log(N)) and a space complexity of O(N), insteadseems to have a complexity of O(N^2 log(N)) for both time andspace. This is awful - worse than any normal sorting algorithm,despite the fact that `staticSort` is based on thenormally-very-efficient "merge sort" algorithm.

On my system, for N = 450 the sort takes about 300 ms andconsumes 120 MB of memory. (With my pull request this is reducedto 80 ms and 35 MB, but that's still terrible.)

Ultimately, I believe it was a mistake for D to implement aseparate, inferior programming language just for templates.However, it is too late to change that now (at least for D2), soI will offer some suggestions as to how memory consumption can bereduced within the current design:

A) Members of a template instantiation should be eagerlyevaluated. Once a template has been fully evaluated, any privatemember which is not referenced by a public one should be deleted.

B) Template instantiations should be deleted after they are nolonger accessible (even indirectly) via a top-level declaration.

C) The compiler should store `T...` in such a way thatrecursively appending N items to the beginning or end of anAliasSeq does not allocate more than O(N log(N)) additionalmemory. (O(N) is possible with more indirections.)

D) Implement tail recursion optimization for templates. Tailrecursion should not count toward the recursion depth limit.

E) Consider eliminating the recursion limit entirely. Given thatthe template system is Turing Complete and mandates heavy use ofrecursion, there is no reason to think that a depth of 500 meanssomething has gone wrong, any more than for a `while` looprunning 500+ iterations. (Implementing this may require fixingthe exponential name growth, though.)

Implementing A, B, and C should get D's template memoryconsumption under control.Implementing D and E will make template computation moreflexible, encouraging people to use it more and find new thingsto complain about. :-P


Thoughts?

D's memory-hungry templates

Reply via email to