Re: Reggae v0.0.5 super alpha: A build system in D

Atila Neves via Digitalmars-d-announce Fri, 03 Apr 2015 11:11:16 -0700

On Friday, 3 April 2015 at 17:55:00 UTC, Dicebot wrote:

On Friday, 3 April 2015 at 17:25:51 UTC, Ben Boeckel wrote:
On Fri, Apr 03, 2015 at 17:10:31 +0000, Dicebot viaDigitalmars-d-announce wrote:
On Friday, 3 April 2015 at 17:03:35 UTC, Atila Neves wrote:
> . Separate compilation. One file changes, only one file> gets rebuilt
This immediately has caught my eye as huge "no" in thedescription. We must ban C style separate compilation, thereis simply no way to move forward otherwise. At the very leastnot endorse it in any way.
Why? Other than the -fversion=... stuff, what is reallyblocking this? Ipersonally find unity builds to not be worth it, but I don'tseeanything blocking separate compilation for D if dependenciesare set up
properly.

--Ben
There are 2 big problems with C-style separate compilation:

1)
Complicates whole-program optimization possibilities. Oldschool object files are simply not good enough to preserveinformation necessary to produce optimized builds and we arenot in position to create own metadata + linker combo tocircumvent that. This also applies to attribute inference whichhas become a really important development direction to handlegrowing attribute hell.
During last D Berlin Meetup we had an interesting conversationon attribute inference topic with Martin Nowak and droppinglegacy C-style separate compilation seemed to be recognized asunavoidable to implement anything decent in that domain.
2)
Ironically, it is just very slow. Those who come from C worldgot used to using separate compilation to speed up rebuilds butit doesn't work that way in D. It may look better if you changeonly 1 or 2 module but as amount of modified modules grows,incremental rebuild quickly becomes _slower_ than full programbuild with all files processed in one go. It can sometimesresult in order of magnitude slowdown (personal experience).
Difference from C is that repeated imports are very cheap in D(you don't copy-paste module content again and again like withheaders) but at the same time semantic analysis of importedmodule is more expensive (because D semantics are morecomplicated). When you do separate compilation you discardalready processed imports and repeat it again and again fromthe very beginning for each new compiled file, accumulatinghuge slowdown for application in total.
To get best compilation speed in D you want to process as manymodules with shared imports at one time as possible. At thesame time for really big projects it becomes not feasible atsome point, especially if CTFE is heavily used and memoryconsumption explodes. In that case best approach is partialseparate compilation - decoupling parts of a program as staticlibraries and doing parallel compilation of each separatelibrary - but still compiling each library in one go. Thatallows to get parallelization without doing the same costlywork again and again.


Interesting.

It's true that it's not always faster to compile each moduleseparately, I already knew that. It seems to me, however, thatwhen that's actually the case, the practical difference isnegligible. Even if 10x slower, the linker will take longeranyway. Because it'll all still be under a second. That's been myexperience anyway. i.e. It's either faster or it doesn't makemuch of a difference.

All I know is I've seen a definite improvement in myedit-compile-unittest cycle by compiling modules separately.

How would the decoupling happen? Is the user supposed topartition the binary into suitable static libraries? Or is thesystem supposed to be smart enough to figure that out?


Atila

Re: Reggae v0.0.5 super alpha: A build system in D

Reply via email to