Re: reggae v0.10.0 - The meta build system just got better

Adam D Ruppe via Digitalmars-d-announce Sun, 17 Sep 2023 18:06:47 -0700

On Friday, 15 September 2023 at 20:22:50 UTC, Atila Neves wrote:

An argument could be made that it could/should install thedependencies such that only one `-I` flag is needed.


Indeed, this would be god tier.

~190k SLOC (not counting the many dub dependencies) killed dmdon a system with 64GB RAM + 64GB swap after over a minute. Evenif it worked, it'd be much, much slower.

What you do with the lines of code is *far* more important thanhow many there are.

The arsd library has about 219,000 lines of text if you deletethe Windows-only and obsolete modules (doing so just so I canactually dmd *.d here on my Linux box). This includes commentsand such; dscanner --sloc reports about 98,000.


$ wc *.d
<snip>
 218983  870208 7134770 total
$ dscanner --sloc *.d
<snip>
total:  98645

Let's compile it all:

$ /usr/bin/time dmd *.d -L-L/usr/local/pgsql/lib -unittest-L-lX115.35user 0.72system 0:06.08elapsed 99%CPU (0avgtext+0avgdata1852460maxresident)k

0inputs+70464outputs (0major+536358minor)pagefaults 0swaps

That's a little bit slow, over 5 seconds. About 1.3 of thoseseconds are spent in the linker, the other 4 remain with dmd -c.Similarly, that's almost 2 GB of RAM it used, more than itprobably should, but it worked fine.

My computer btw is a budget model circa 2016. Nothingextraordinary about its hardware.

But notice it isn't actually running out of RAM or melting theCPU over a period of minutes, despite being about six figureslines of code but any measure.



On the other hand, compile:

enum a = () {
   string s;
   foreach(i; 0 .. 20_000_000_000)
     s ~= 'a';
   return s;
}();

Don't actually do it, but you can imagine what will happen. 6lines that can spin your cpu and explode your memory. Indeed,even just importing this module, even if the build system triednot to compile it again, will cause the same problem.

The arsd libs are written - for the most part, there's someexceptions - with compile speed in mind. If I see my build slowdown, I investigate why. Most problems like this can be fixed!

In fact, let's take that snippet and talk about it. I had toremove *several* zeroes to make it even work without freezing upmy computer, but with a 100,000 item loop, it just barely worked.Even 200,000 made it OOM.


But ok, a 100,000 item append:

0.53user 1.52system 0:02.17elapsed 95%CPU (0avgtext+0avgdata4912656maxresident)k

About 5 GB of RAM devoured by these few lines, taking 2 secondsto run. What are some ways we can fix this? The ~= operator isactually *awful* at CTFE, its behavior is quadratic (...or worse,i didn't confirm this today, but it is obviously bad). So you canfix this pretty easily:


enum string a = () {
   // preallocate the buffer instead of append
   char[] s = new char[](100000);
   foreach(ref ch; s)
     ch = 'a';
   return s;
}();

0.17user 0.03system 0:00.21elapsed 98%CPU (0avgtext+0avgdata45748maxresident)k 16inputs+1408outputs(0major+21995minor)pagefaults 0swaps

Over 10x faster to compile, 1/100th of the RAM, ram result. Realworld code is frequently doing more than this example andrewriting it to work like this might take some real effort....but the results are worth it.

And btw try this: import this module and check your time/memorystats. Even if it isn't compiled, since ctfe is run when themodule is even just imported, you gain *nothing* by separatecompilation!

...but there are times when you can gain a LOT by separatecompilation in situations like this, if you can move the ctfe tobe some private thing not exposed in the interface. This requiressome work by the lib author too though in most cases. An examplewhere you can gain a lot is when something does a lot of internalcode generation but exposes a small interface, for example ascripting language wrapper (though script wrappers can also bemade to compile reasonably efficiently if you use things likepreallocation of buffers, keep your generated functions short(again, the codegen has quadratic behavior, so many smallfunctions work better than a big one, and if you factor the codewell, you can minimize the amount of generated code and call backto generic things, e.g. type erasure), collapse templateinstances, and keep ctfe things ctfe only with a variety oftechniques, so they are not codegened unless they are actuallynecessary).

My arsd.script and arsd.cgi can wrap large numbers of functionsand classes reasonably well, but that's why programs using themtend to be multi-second builds.... just note that's programsusing them. Separate compiling the libraries doesn't help. You'dhave to structure the code to keep those codegen parts internalto a package with a minimal interface, then separate compilingthose internal components might win.

But this is a fairly niche case. Yes, I know there's one majorcommercial D user who do exactly this. But that's the exception,not the rule.

Re: reggae v0.10.0 - The meta build system just got better

Reply via email to