GNU/Linux ABI documentation ? GCC supports SSSE3 in general purpose code generation ?

Darryl L. Miles Mon, 12 Jul 2010 04:59:30 -0700

Is there a document or standard (or group of standards) that define thecollective ABIs of GNU/Linux systems using ELF binary formats of variousCPU architectures, including at least:

 IA32 (i386/i686/AMD64/EMT64/etc...)
 ARM (v5, v5t, v7, etc...)

What is the policy of the GNU toolchain, does it attempt to support asuper-set of features where code was contributed but not directlyset/enforce policy itself. This being a matter for a distributioncreator to establish.

My concern comes from what looks to be non-backwards compatible changesbeing made by the MeeGo distribution of Linux. Specifically theenablement of SSSE3 IA32 instructions for "general purpose codegeneration", one possible motive for this is that a particular hardwarevendor can claim in marketing the platform is optimized for XYZtechnology/product, the thing they are actively trying to sell of theday. This doesn't necessarily make for a good engineering choice.

While I accept that any distribution can do what they want, given thechoice and resources I might wish to rebuild the entire open sourceproject based around a better set of rules over this matter, since Ihave an interest in using the software but being a developer want theleast amount of headaches looking into the future. I also wish toachieve the goals to be able to provide a fully native SSSE3 optimizedcomplete system but want other systems to behave in a repeatable andconsistent way when binaries ultimately end up shared across them(desktop Linux).

Does GCC support use of newer CPU instructions for "general purpose codegeneration" ? If so what kind of situations might they get selected foruse ? It is possible the situation is being misinterpreted by me, ifGCC can not actually schedule newer instructions in code generation.

So my next question is what support is there in the various formats,technologies and runtime libraries to provide a backwards compatiblesolution, such that a binary from one system when put on another canhave any hardware incompatibilities detected at the soonest opportunity,for example upon execution, soon after execution, during DSO loading:

1) ELF magic or hwcap (this would allow the kernel to error during theexec() system call, knowing that the format is not supported by thesystem). DSOs loaded would also error with the same checking. Ideallya hwcap system ideally doesn't want to be a rigid bitmask but some kindof extensible ASN.1 system where anyone can register for their ownhierarchical domain and assign whatever they want from within it.

2) Use of a bespoke/custom Dynamic Linker path, I would guess any systemdoing this would be free to implement an alternative ABI. Mixingbinaries between systems would result in them not working due to lack ofDynamic Linker at that path.

3) Do the ABIs directly discuss of explain how such matters should beaddressed, i.e. by guarding the execution of new CPU instructions byruntime checking. This might mean whole optimized DSOs are loadedinstead, or it might mean bunch of symbols would be redirected tooptimized code within the same DSO, or it might mean an inline change ofthe flow of execution. While I understand ABIs maybeopen/loose/ambigious to allow new technologies and ideas to exist whenknown interoperability problems appear then guidelines should exist thatprovide technical answers to guide implementers so that good citizenshipmay follow.


4) Use of some ELF section to describe additional runtime checking rules.

The problem stems from hit and miss users getting SIGILL due to use ofunguarded IA32 instructions being executed on non compatible (older)CPUs. The kernel doesn't provide any trap and emulation, so the generalpurpose applications abort resulting is possible data loss. Doguidelines exist within the GNU/Linux ABI on how to be a "good citizen"and help systems differentiate incompatible binaries so they simplydon't run instead of causing a SIGILL potentially some months afterexecution started, because over the entire executable only a tinyhandful of these instructions got selected by the compiler for use andthat code didn't get run for a long time after the executable started.

The next matter is has anyone done any studies on the performancedifference when enablement of newer instructions is possible for"general purpose code generation". I'm not so interested in specializeduse cases such as codecs, compression, encryption, graphics, etc... Iconsider these specialized use cases for which many applications andlibraries already have a workable solution by "guarding" the executionof instructions that optimize such algorithms by checking the CPUruntime support. I'm interested in the facts on how much benefitregular code gets from this choice.


Thanks,

Darryl

GNU/Linux ABI documentation ? GCC supports SSSE3 in general purpose code generation ?

Reply via email to