Re: DIP10005: Dependency-Carrying Declarations is now available for community feedback

Andrei Alexandrescu via Digitalmars-d Sat, 17 Dec 2016 16:35:58 -0800

On 12/17/2016 02:34 PM, Chris Wright wrote:

Just looking at this again:

The obvious workaround to the problem that dependencies must be module-
level is to simply define many small modules---in the extreme, one per
declaration.


Andrei works in phobos a lot. Phobos has a lot of large modules. For
instance, std.datetime is 35,000 lines. It's not unusual for a phobos
module to have over 6,000 lines (std.math, std.typecons, std.traits,
std.format, std.conv).

Let's take a look at that hypothesis. The example I chose randomly (andwhich turned to be a rat's nest of fuzzy dependencies) was std/array.d,clocking at 3585 lines. Then looking at the entire project:

wc -l std/*.dstd/{algorithm,container,digest,experimental,internal,net,range,regex}/**/*.d| sort --key=1 -n | cat -n

This outputs the modules in the standard library (excluding those thatare simple header translations), sorted by LoC, numbered. See result inhttp://paste.ofcode.org/Lc5xfcs8GqpT2cabApSSgk. That shows 137 modules,median length 903, average length 2055 --- including full documentation,unittests, and examples. These numbers seem quite reasonable and ifanything compare favorably against other projects I've been on.

I'd normally recommend breaking up modules at one fifth that size.

Yeah, std/datetime.d is a monster, from what I can tell owing to a roteand redundant way of handling unittesting. I didn't look at itsdependencies, but I doubt they are special. I was quite vocal aboutbreaking it up, but I got mellower with time since (a) someone measuredits size without unittests and it was something like one order ofmagnitude smaller, and (b) there was really no more trouble using ormaintaining it than with anything else in Phobos.

I should also add that each large project has a couple of outliers likethat. I even recall a switch of a couple thousand lines once :o).

The
standard library benefits from low granularity modules. It needs to
implement a variety of related tools for working with particular things.

For the hunting-for-definitions case, you also need:

* a module with more than a few imports, from different libraries or
packages
* ambiguous names, or functions that are widely used
* the user can't use an IDE / ctags / dcd
* the user can't use ddox / dpldocs.info, which turns type references
into links; or the user is using that and needs to find the definition of
a template constraint
* the maintainer cannot use selective imports
* the maintainer cannot break the module up to reduce the number of
dependencies
* the maintainer is willing to spend the effort to convert top-level
imports into tightly scoped imports

For the compilation-speed case, you need:

* large dependencies that this allows you to skip (the module combines
several types of functionality with different dependencies)
* the imported module must be in another compilation unit (incremental
compilation or a separate library)
* the dependencies can't be used by any other module in the compilation
unit
* no selective imports
* the module being compiled depends on something in the same scope

That's a pretty marginal use case.

Most of these have been the case with all C++ and D projects I've beeninvolved with at Facebook.

Please let me know what of this information I should include in the DIPto make it better. Thanks.



Andrei

Re: DIP10005: Dependency-Carrying Declarations is now available for community feedback

Reply via email to