Re: Rethinking configuration tuples

John Ericson Tue, 05 Sep 2023 12:17:21 -0700

On 8/30/23 22:24, Jacob Bachmeyer wrote:

John Ericson wrote:
Err I mean, is there am example of a *-*-linux-$nongnu-musl?
I would expect that to name an embedded environment using Musl libcand the Linux kernel, but that is not a full system. (Example: maynot even have a shell at all)

I suppose except for the system() function in libc, I would considerthis a distinction not needed for configs. The choice of what otherprograms to run (be they init or shell) feels to me not like abuild-time / development configuration decision, but a a runtime / opsconfiguration decision. They aren't "viral" decisions in the way thatthe choice of libc is (since all shared objects that may be combinedtogether need to agree on their deps, most notably libc).

Maybe this could go in the config per the "arbitrary many components,finer distinctions to the right" "converging sequence" approach, butthen I would want this further to the right, e.g.aarch64-unknown-musl-noshell not aarch64-unknown-noshell-musl.

The choice of system service management is orthogonal to this, sinceit has minimal impact on user programs. (Unless systemd gets evenmore outrageously invasive...)
Agreed, just wanted to double check.
Of course, if systemd *does* get sufficiently outrageously invasive,we might need a *-*-linux-systemd-glibc tuple... (Since systemdgleefully makes extensive use of Linux-kernel-specific features, itcannot possibly be a standard on the GNU system, which supportsmultiple Free kernels.)

Yes I agree systemd probably can't be "bonafide GNU OS", but I take theopposite conclusion that this is evidence for the "gnu" for glibc ismore important than the "gnu" for "true GNU OS".

Except configure usually does not need a "fully disambiguated"form---the canonical form produced by config.sub is fine, sinceconfigure is usually matching against the full tuple using shellcase patterns. The flat list with a defined order is optimal forthis strategy, since it allows to easily check for the presence ofany tag or combination of tags.
Shell case patterns can be a bit of a footgun. For example, a commonmistake is doing * instead of *-*.
If the allowed pattern elements are sufficiently unambiguous, there isno mistake, since `*' matches text including `-'. In fact, whentesting n "is tag FOO present?" predicate `*-foo-* | *-foo' would becorrect. (I assume that a CPU type will remain required and willremain first in the list.)

Sorry I meant as part of a larger pattern. With things like *-stuff-* vs*-*-stuff-*-*, the extra dashes are needed to make sure "stuff" matchesthe right component, and even then it only works if one knows the exactnumber of components (which can be accomplished by *-*... and theordering of patterns). It is quite subtle!

Allow the hypothetical --parse option to accept a PREFIX argument andyou are pretty much there:
$ ./config.sub --parse=host x86_64-linux-gnu
host=x86_64-pc-linux-gnu
host_cpu=x86_64
host_vendor=pc
host_kernel=linux
host_os=gnu
$
That form should be both easily parsed by other tools and suitable for`eval` in shell scripts.

Yup! We're in agreement.

I agree testing is more robust, but for better or worse I still dosee scripts using those host_* variables mentioned above. (Testing ispossible but requires more care to get right for cross-compilation,for one.)
In this case the test is `case $host in ... esac`.

I would say it is better to case on (combinations of `host_*` variablesthan `$host`, because then knows exactly what components are being casedupon; there is no ambiguity. I think one should basically only use`host` as a block-box identifier (e.g. prefixing binaries) and and othertime one would like to use `host` they should use the `host_*` variablesinstead.

The problem is still getting it /into/ config.sub: config.sub expectsa single command-line argument, while pre-parsed form spans a few lines.

I don't think that is so hard. config.sub accepts --gnu-long-argsalready )without confusing them as configs) so we can simply dosomething like


./config.sub --pre-categorized cpu=x86_64 vendor=pc kernel=linux os=gnu

and then there is no confusing the two forms of input.

[...]
I am not entirely certain why, but I know that there is some reasonwe call the common GNU/Linux systems *-*-linux-gnu instead of*-*-linux.
To be honest, I think this is basically the "call it GNU/Linux notLinux" controversy --- i.e. at the time it was done for social nottechnical reasons. I don't mind, since now that we have multiplelibcs there /is/ a technical reason to distinguish. But this circlesback to my hunch that Kernel (syscall interface) + libc (ABI)determines OS uniquely enough for config.sub's purposes.
That is possible, but still a valid reason for the GNU Project to staywith that angle.

Yeah I have no problem with the term GNU/Linux, I just don't think "OS"is useful for config.sub. "Linux + GNU libc" for config.sub; "GNU/Linux"for humans/prose.

Erm I mean not an extant system that would use such a config underyour system, but an extant config (not necessarily a GNU one, couldbe an LLVM, Rust, or something else one) for such a system. In otherwords, I am asking whether there was a case where someone elseevidently decided that kernel+libc was not enough info and OS wasalso needed to further disambiguate.
I do not know of any off the top of my head.

OK. For the record, I wouldn't focus on this "OS-libc" stuff so much,except I suspect it would get in the way of the sort of grandreconciliation between us, LLVM, Rust, etc. that needs to happen. If the"OS is actually libc" way it's ended up elsewhere is acceptable to GNUConfig, as I hope it can be, and how downstream often uses GNU Config inpractice, that gets us much closer to consensus.


John

Re: Rethinking configuration tuples

Reply via email to