Re: RFR: 8210416: [linux] Poor StrictMath performance due to non-optimized compilation

joe darcy Wed, 12 Sep 2018 18:45:14 -0700

Hello,

On 9/12/2018 1:16 AM, Severin Gehwolf wrote:

On Wed, 2018-09-12 at 17:58 +1000, David Holmes wrote:

But I don't understand why the optimization setting is being tied to the
availability of the -ffp-contract flag?

In configure we perform a check for gcc or clang whether that flag is
supported. If it is, it would be non-empty exactly having -ffp-contract
as value. It could be another set of flags for other arches if somebody
wanted to do the same, fwiw. In JDK 8, for example, it's "-mno-fused-
madd -fno-strict-aliasing" for ppc64:
http://hg.openjdk.java.net/jdk8u/jdk8u-dev/jdk/file/2660b127b407/make/lib/CoreLibraries.gmk#l63


We need support for that flag (or a set of flags) when we optimize
fdlibm since otherwise we would lose precision. If the flag is empty
we'd not optimize as we can't guarantee precision. That's why we tie
optimization to the availability of that flag. The expectation is for
this flag to be available on gcc/clang arches only at this point. Does
that make sense?

To condense a potentially long discussion, while the IEEE 754 standardhas long specified particular results for arithmetic operations (+, -,*, /, etc.) on particular floating-point values, languages and theircompilers often do not provide a reliable mapping of language constructsto IEEE 754 operations.

The Java language and JVM are distinctive in this sense because areliable mapping of language-level operation to particular IEEE 754operation is mandated by the JLS. (I will leave aside a complicated butlargely irrelevant discussion of non-strictfp floating-point.)

The C language standards I've looked at do not provide as reliably amapping of floating-point operations as the JLS does. In particular, theC standards generally allow a fused multiply add to be used replace apair of add and multiply instructions in an expression like (a * b + c).The -ffp-contract=off gcc compiler setting disables this and relatedtransformations. (The Sun Studio compilers provide detailedconfiguration options for the sets of floating-point transformationsthat are allowed.)

The specification for StrictMath requires the fdlibm algorithms and thefdlibm algorithms rely on the semantics of the floating-point operationsas written in the source and also rely on some way of doing a bit-wiseconversion between an integral type and double. The latter isaccomplished by interpreting the 64-bits of a double as comprising atwo-element array of 32-bit ints. These idioms often don't work underthe default C compiler options, leading to the long-standing need tohave a separate set of compiler options for FDLIBM. A safe, if slow, setof options is to fully disable optimization for FDLIBM. That is notnecessary if sufficient control over the floating-point and aliasingsemantics is possible via the C compiler options.

In the fullness of time, when (if?) I finish porting the FDLIBM code toJava, these sorts of concerns will no longer apply due to the morereliably mapping of source expressions in Java to IEEE 754floating-point operations.


HTH,

-Joe

Re: RFR: 8210416: [linux] Poor StrictMath performance due to non-optimized compilation

Reply via email to