Re: [dev] CMake

Mathias Bauer Tue, 01 Jun 2010 05:34:34 -0700

Hi Bill,

it took longer than expected, but here we are. I took the CMake buildprotoype again, that my colleague Martin Hollmichel provided, and gaveit another whirl. The results didn't change much.


In short words:

Of course CMake is able to build OOo, it is also better than our currentbuild system. But the benefit of using CMake does not appear to be bigenough to make the switch. And there are still some disadvantages inCMake because it is a recursive system and some open questions. I willtry to explain that all.

It's possible that my summary is caused by my own misunderstanding ofthe documentation for CMake that I found in the net and from my lurkingon the CMake mailing list for some weeks. So perhaps you can add somenew points to that evaluation.

Let me start with the last point in your mail as it touches a centralpoint and perhaps we can get somewhere by shifting the discussion to theessentials:

Another aspect of CMake that you do not address in your evaluation is
that fact that with CMake you will be "outsourcing" the maintenance of
the build system to the CMake developers.

Sure, that goes without a saying. But we also don't plan to take overthe maintenance of GNU Make ;-).

It's correct that we don't use the "raw" GNU Make, we will use somemacros that provide a higher abstraction layer to avoid some commonmistakes that we found in our current build system. These macros arewhat makes this system attractive for us. They provide some importanthigh-level functionality that GNU Make and CMake don't not have out ofthe box. So we had to implement the same macros in CMake's macrolanguage if we wanted to have the same features. It's unclear to me ifwe can do all of them in CMake, but at the end that probably doesn'tmatter so much, as even if we could do that, I don't see the benefit ofusing CMake instead of GNU Make then, as we had to do the same amount ofwork. But I see a potential disadvantage: we would get an additionallayer between our makefile code and the build system that actually doesthe work.

The makefiles the developers had to create for the GNU Make based buildsystem would be comparable to the CMake makefiles I have seen in MartinHollmichel's build, so there's nothing that makes a difference in one orthe other direction.

I still think that probably CMake does not provide all features of GNUMake that we use in our macros. But perhaps it doesn't make sense todiscuss single features without knowing the complete story. I assumethat one can do *nearly* everything we planned to do also with CMake orone can at least find a replacement that works quite similar. But if onetries to use all of these things in combination, the result might not bethe same.

So my idea to mention some single plus and minus facts in my summaryobviously was a mistake, as it could be misunderstood easily anddistracted the discussion from the general level to some minor points,even in total they add to the final result. Without the necessarycontext of how we are doing things and how we want to do them in futurethe discussion digressed.

So let's focus on the big issues. That should be enough. So those thingsthat made me add the "nearly everything" in my sentence above.


The first big issue that I have is about dependencies.

Maybe it's me who didn't find the trick, but a forgotten add_dependencystatement gave me a build that didn't break before it started (as itwould do in our projected build system using the abstracting macros),but somewhere later. And it didn't break if by accident the samedependency in another makefile (totally unrelated to the "broken" one)made the build work. This is exactly the unstable behavior we want toget rid of. It will happen less often than in our current build system,but it is still possible.

If a build system knows all dependencies, it can use a correct buildorder. If it doesn't, it needs help from the developer that now takesover some of the duties that should belong to the build system. In arecursive build system that means telling the build system: "you need tobuild Y before you can build X. I don't tell you how to build Y, insteadof that I tell you to call make for makefile M, this will deliver Y andyou can proceed with building X. Believe me." This makes building Y aside effect from the POV of the build system, and from past experiencewe dislike that. A build system can do better if provided with the fullinformation. That's where recursive and non-recursive build systems differ.

Becoming unsusceptible for side effects is important for us and thissusceptibility in our current system was the main reason to consider anew build system at all. Without writing the same kind of macros that wehave written for GNU Make we wouldn't get that in CMake, as my test hasproven. So, as I wrote already, at the end there's no implementation ormaintencance benefit from using CMake.

The second issue I have is that CMake still is a recursive system.Though it is better in resolving dependencies than "classical" recursivesystems, it still has the other inherent disadvantages of them. Just toname the two most obvious ones: it does not scale with the number ofprocesses as good as a non-recursive system does (especially in case ofpartial builds) and it needs to traverse all source directories just tofind out that nothing needs to be built. From our current build systemwe know that this is a PITA at least on Windows, where it takes between7 and 20 minutes for that on standard hardware (quad core CPU),depending on how "hot" the disk cache is. From our GNU Make basedprototype we estimate that a non-recursive system for OOo, that does nottraverse through the file system but includes makefiles, is ~5 timesfaster on Windows. That's a lot.

To my knowledge based on lurking on the mailing list and reading thedocumentation on your web site I concluded that CMake can be usedrecursively only. I would be glad if you could prove me wrong and showus that we can use CMake in a non-recursive way too without losingsomething else and that we won't run into scalability problems with ourhuge project. An example for what that could be: we made somescalability tests for GNU Make before we started using it and discoveredthat it doesn't scale well with the number of rules above ~25000 rulesor so, but that could be fixed easily by using pattern rules as much aspossible.

The biggest problem of a switch to CMake besides the ones already
mentioned is that it does not offer a migration path for us.
Switching our build system to CMake would require a one shot
conversion, including parallel maintenance of makefiles for the whole
duration of the switch (that is estimated to last for several
months).


There are ways to do this. In particular, a recent addition to CMake
call external projects might be useful.

Our current build system is a huge perl program that builds "modules"(sub projects) by calling dmake processes. It evaluates the dependenciesbetween the modules and does a lot more. It's comparable to what CMakedoes, though CMake does it better.

In our planned non-recursive build system most of the duties of the perlprogram just would go away and the replacement for it would be a quitesimple makefile that just includes all other makefiles. Each of them canbe converted from dmake to GNU Make step by step.


Here's how it would look like (example covers 8 modules):

GBUILDDIR := $(SOLARENV)/gbuild
include $(GBUILDDIR)/gbuild.mk


include $(foreach module,\
        framework \
        sfx2 \
        svl \
        svtools \
        xmloff \
        sw \
        toolkit \
        tools \
,$(SRCDIR)/$(module)/prj/target_module_$(module).mk)

all : $(foreach module,$(gb_Module_ALLMODULES),$(call 
gb_Module_get_target,$(module)))
        $(call gb_Helper_announce,Completed all modules.)

clean : $(foreach module,$(gb_Module_ALLMODULES),$(call 
gb_Module_get_clean_target,$(module)))
        $(call gb_Helper_announce,all modules cleaned.)

.DEFAULT_GOAL := all

(Don't look too close at the somewhat long names; it's still anunpolished prototype.)

The included makefiles could be plugged into this makefile as well ascalled from the perl program without changing a single line of code,just a one-line wrapper is needed to call it in the Perl program. Andeach of these makefiles can be executed standalone.

Migrating to CMake would require to first convert the Perl program to aCMake process and then add each module as an external project,converting them to CMake later on step by step. This way we had toreimplement a lot of functionality that in our projected non-recursivesystem just wouldn't be needed.

So all in all the migration for a CMake conversion would be more work todo. Though I meanwhile think that it wouldn't be a one shot conversionas I wrote in my first mail. I got this impression because I only thoughabout doing it the other way around (call CMake from the Perl programuntil all modules are converted).

Until now the balance says to me: no difference wrt. maintenance anduser make files, more work to do for a conversion to CMake and somedisadvantages due to its recursive nature. Now to some open questions.

Another topic that is mentioned is the fact that builds have to be run
from the top. That is not entirely true. You can have subprojects that
are built on their own. Each add_subdirectory can point to a complete
project. If you cd into the sub directory in the build tree, you can
just run make, and it will only build the targets associated with that
sub-project. Also, you could run cmake on the sub-project by itself if
it is written to work as an independent sub-project.

I never wanted to say that with CMake you always have to run from thetop. But it seemed to me that by organizing the build in a way to allowmore flexibility I lose other things.

Sure, we can create a CMake makefile for each module we want to be ableto build invidually and make it an own CMake project. But of course westill want to have the full build over all modules, and we need itsresult in the same working directory as the build of the individual modules.


The project structure we are aiming at would look this way:

-ooo
--mod1
--mod2
...
--modn

-sun
--sun-mod1
--sun-mod2
...
--sun-modn

We have these two top level projects because we want to build OOo andour commercial variant from the same source tree, as they share most ofthe code.

We want to go to any sub module and build it independently from therest, if possible without any external dependencies to other modules. Ofcourse this would require that all other modules "below" had been builtbefore at some time. If you ask why building single modules inside thewhole tree is so important: remember the long time it takes on Windowsto traverse through modules where nothing has to be built.

We want to go into any of the two top modules (here named ooo and sun)and do a complete build, with all dependencies evaluated correctly as wewould get it for a "normal" project without sub projects.

We want to share the working directory between the two top levelprojects as we don't want to build the common parts twice, but each ofthem must be buildable separately as outside of our Hamburg lab only onetop level project exists.


How do I have to setup a CMake build to support that build structure?

Regards,
Mathias

--
Mathias Bauer (mba) - Project Lead OpenOffice.org Writer
OpenOffice.org Engineering at Sun: http://blogs.sun.com/GullFOSS
Please don't reply to "nospamfor...@gmx.de".
I use it for the OOo lists and only rarely read other mails sent to it.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@openoffice.org
For additional commands, e-mail: dev-h...@openoffice.org

Re: [dev] CMake

Reply via email to