Re: prebuilt libraries?

Gregory Szorc Wed, 26 Nov 2014 08:49:41 -0800

On 11/25/14 10:50 PM, Andreas Gal wrote:


Would it make sense to check in some of the libraries we build that we very 
rarely change, and that don’t have a lot of configure dependencies people 
twiddle with? (icu, pixman, cairo, vp8, vp9). This could speed up build times 
in our infrastructure and for developers. This doesn’t have to be in 
mozilla-central. mach could pick up a matching binary for the current 
configuration from github or similar. Has anyone looked into this?

Let me rephrase this request: you are asking for a cache of binaryartifacts for the build.


Yes, this is critically important for developer productivity.

Yes, it has been looked at extensively.

Yes, it is achievable.

But, it is a lot of work and the historical low engineering investmentin the build system has prevented this from coming to fruition thus far.


Some background.

There are 2 ways you can build your cache: high-level or low-level.

In the low-level approach, you effectively have a globally distributedccache. This is what Mike Hommey has built in sscache. It's what releaseautomation uses. It even works on Windows. The low-level approachperforms per-object lookup when the build system is ready to producethat object. Since the build system e.g. produces .o files then .sofiles, you must first obtain cached values for the intermediate objects,then you move on to the final objects. This is the nature of a low-levelcache.

In the high-level approach, you recognize what the final output is andjump straight to fetching that. e.g. if all you really need is libxul,you'll fetch libxul.so. None of this intermediary .o files foo.


Different audiences benefit from the different approaches.

Firefox desktop, Fennec, and FxOS developers benefit mostly from ahigh-level approach, as they don't normally care about changing C++.They can jump straight to the end without paying a penalty of dealingwith intermediaries.

Gecko/C++ developers care about the low-level approach, as they'll bechanging C++ things that invalidate the final output, so they'll befetching intermediate objects out of necessity.


Implementing an effective cache either way relies on several factors:

* For a high-level cache, a build system capable of skippingintermediates to fetch the final entity (notably *not* make).* Consistent build environments across release automation and developermachines (otherwise the binaries are different and you sacrifice cachehit rate or "accuracy").* People having fast internet connections to the cache (round tripsdon't take longer than building locally).* Fixing C++ header dependency hell so when C++ developers changesomething locally, it doesn't invalidate the world, causing excessivecache misses and local computation.* Writing to a globally distributed cache that is also read by releaseautomation has some fun security challenges.* Having a database to correlate source tree state with build artifacts*or* a build system that is able to compute the equivalent DAG toformulate a cache key (something we can't do today).

There is a lot buried in that bullet list. These problems aren't goingto solve themselves overnight.

A low-level cache is achievable. We already have one in ccache andsccache. (ccache is local, sccache is distributed in S3). However, wewon't be able to get cache hits from sccache until we reproduce therelease automation build environment on local machines. Deterministic,bit-identical builds, anyone? Fortunately, Morgan Phillips is working onmaking release automation's build environment distributable, unblockingthis aspect.

The high-level cache requires separate things. Modern build systems haveartifact caches built in. They can jump straight to the end result andskip intermediaries. We can't have nice things with the 30+ year oldtool that is GNU Make. Well, we could, it just require us to invent abuild mode that short-circuits compilation, linking, etc and manuallyfetches the final object from the cache. IMO we should build this forFirefox and Firefox OS developers. We kinda/sorta already have this inxulrunner and parts of Firefox OS builds. We know the approach works. Wejust need to make it turnkey and better integrated with release automation.

I'd like to see us invest in both high-level and low-level approaches,as I believe both audiences are large enough to warrant targetedinvestment. Historically, we've leaned heavily towards low-level.

_______________________________________________
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform

Re: prebuilt libraries?

Reply via email to