bug#44462: Problem with get_multilibs on macOS

Fred Wright Mon, 09 Nov 2020 18:45:19 -0800


On Sun, 8 Nov 2020, Jacob Bachmeyer wrote:

Fred Wright wrote:
On Thu, 5 Nov 2020, Jacob Bachmeyer wrote:
Fred Wright wrote:
Even when it's using gcc, it bloats the logfile with the verbose-dumpspecs output, in order to determine something of highly questionablevalue.
That value may be less questionable for some targets, or for tests thatneed to be run across each multilib target a GCC instance supports.
Even in the useful multilib cases, making that behavior unconditional seemslike a bad idea, since the client code may have its own iteration acrossarchitectures, and doing both would either not work or explode themulti-architecture handling from O(N) to O(N^2).
The get_multilibs procedure does not perform iterations itself. I am not yetcertain what exactly it is supposed to do, since it seems to do a largeamount of work to set the "multitop" board_info key. Whatever it does, it isclearly specific to GCC, but does not even check if the selected compilereven looks like GCC.
I traced the results of get_multilibs (as used by the libffi tests) inmany macOS versions, and even the "successful" cases seem to havequestionably useful results:
[...]
The "/usr/." result returned on 10.4, 10.5, and 32-bit 10.6 is the sameas what I see on Ubuntu 14.04, CentOS 7, and Fedora 25, though it's notclear to me what it's supposed to represent.
I have a suspicion that this feature is designed to support testing withan "experimental" compiler build that is not installed on the system andmay be useless with system compilers generally, or with Apple's compilersspecifically, if Apple does not use multilib.
Apple supports the concept of multi-architecture binaries, but not in thesame way that multilib does it (AFAIK). Macs can have "universal"binaries, which are archives combining multiple per-architecture slices.This is applicable to object files, shared libraries, and executables. Ifthe build setup allows it, it can be as simple as including multiplearchitecture options in the compile command. E.g.:
    cc -arch x86_64 -arch i386 -o hello hello.c
Under the hood, the compiler driver runs a separate compile/assemble foreach architecture, and then combines the object files. The linker supportsuniversal binaries directly.
With this arrangement, architecture-related conditionals in the source codework just fine, but what *doesn't* work is having architecture-relatedparameters in a configure script, which is unfortunately not as uncommon asit ought to be.
No, Autoconf pushes programs to respond to features instead of using knownarchitectures. This approach has been very successful: as I understand,most programs using Autoconf needed no changes at all to port to RISC-V evenif they were written long before RISC-V existed. Many programs, if they hadpreviously been ported to any 64-bit architecture, needed no changes at allfor x86_64. Autoconf has achieved something architecture #ifdefs cannot do:provide automatic portability to a new architecture that did not exist whenthe program was written. I consider Autoconf's approach here completelyvindicated.

I'm not talking about checking for specific architectures; this is aboutchecking architecture-related *properties*. For example, a configurecheck for sizeof(long) is incompatible with multi-architecture builds,while using LONG_BIT works just fine. But there are build procedures thatdo things like the former, for no good reason.

You could submit a patch to Autoconf to add support for multi-arch config.hfiles, where configure would run tests for each of a list of architecturesand use arch #ifdefs in config.h to select the configure results for eachcompile.


I don't plan to mess with Autoconf if I can possibly avoid it. :-)

Since get_multilibs already has code to return an empty string in the"remote" case (where it assumes this function won't work), I just addedcode to unix.exp to set multitop to "" for all "darwin" targets, therebyshort-circuiting almost all og get_multilibs. That certainly fixes theproblem with the libffi tests, and doesn't change any non-Mac behavior,though I don't know if that's the ideal fix. The whole get_mutilibsfunction looks pretty ugly anyway, and it's generally recognized thatrelying on -dumpspecs is a bad idea.
It is most certainly not ideal. A better solution is probably to add atest to get_multilibs to return an empty string if the compiler is notGCC. Of course, if another compiler pretends to be GCC enough to passthat check, but does not actually implement -dumpspecs, that is not ourbug.
Limiting it to gcc would avoid actual failures, but wouldn't avoid bloatingthe logfile with the humongous -dumpspecs output in the many cases wherethe multilibs action isn't even wanted.
The meaning of "pretends to be gcc" isn't well-defined. It's not uncommonto have a compiler named "gcc" which is really clang, largely because thereare so many projects that think that all compilers of interest are named"gcc". And of course, clang tries to be highly gcc-compatible, tofacilitate switching to it, but not to the extent of implementing-dumpspecs, which is is derived purely from gcc's internal implementationdetails, and was never intended to be used in this fashion.
Autoconf has always allowed setting CC to select a compiler. Apple *could*have shipped Clang as "llcc" or similar or even simply the traditional "cc"for a system compiler, but they chose to ship it as "gcc" instead. Not thatthe current version of get_multilibs even bothers to check that the compilerhas "gcc" in its name...
The libffi test suite comes up with a "compiler_vendor" variable whichseems to be able to distinguish clang from gcc, though I haven't looked atthe details.
Fixing get_multilibs properly would probably mean making it both highlyplatform-specific and optional.
The get_multilibs procedure is *not* platform-specific; it is GCC-specific.I am still unsure how exactly it is used.

Since it seems to involve file paths, it may be specific to combinationsof GCC and platforms.

But since neither of us seems to know very much about what get_multilibsis trying to do, it's hard to discuss it intelligently. :-)

This issue on Mac OS X will probably be a known bug in 1.6.3 and fixed in1.6.4.
I primarily tested my patch against the 1.6.2 release, since the currentmaster won't install from a non-git directory, and also has multiplefailures in its own tests (even on Linux). The patch is nearly identicalbetween the 1.6.2 and master cases, anyway.
Are we looking at the same current master? I have commit3d62df24deedfb3c7c3e396a31b8ce431138eb49 here and all of the tests pass.
****These other problems are potential release blockers for 1.6.3.****
Can you file another bug report with the test failures and moreinformation about these issues?
I looked into this more closely and it's probably related to the non-gitissue. When running from a non-git directory, the configure script reportsa "fatal" error, but then goes on to complete with a zero exit status and amore or less buildable setup, so you have to be paying close attention tothe output to notice.

Actually it now looks like the two things are unrelated; I filed twoseparate bugs.

If this is a typical hack to provide git-based extra information inbetween-release version strings, it should have a fallback for the non-gitcase. Consider the case of pushing all the git-tracked files to a testsystem with git ls-files and rsync.
Please file another bug report for this issue. This is separate fromget_multilibs mishandling non-GCC compilers.
I can send the current patch, either as a bare email or as an attachment.AFAIK, Savannah doesn't have the pull request / merge request concept.
This will need to be fixed in libgloss.exp, not unix.exp. I am putting myfoot down on fixing bugs in DejaGnu's own tree directly instead of hackingaround them like that.
Well, OK, but there seem to be other similar hacks in unix.exp, and if theidea is that get_multilibs is completely useless on the Mac (which appearsto be the case with the current implemenation, anyway), then disabling itin the target-related code doesn't seem unreasonable.
It is not completely useless even on MacOS X -- a user could install GCC andexpect a testsuite to use it, particularly in the case of a cross-compilerfor embedded development.

If it's a cross compiler, then by definition it can't run the resultingcode on the host platform. But since get_multilibs already excludes allremote cases, it wouldn't be able to run it on a separate target platform,either.

Besides, aren't files like baseboards/unix.exp based on the *target*platform, not the host? If so, then it seems like disabling get_multilibsfor the Mac there is exactly the right thing, at least until such time asget_multilibs can behave usefully for a Mac target.


Fred Wright



_______________________________________________
Bug-dejagnu mailing list
Bug-dejagnu@gnu.org
https://lists.gnu.org/mailman/listinfo/bug-dejagnu

bug#44462: Problem with get_multilibs on macOS

Reply via email to