Re: Very high cost of ieee_next_after function

Damian McGuckin Sun, 10 Aug 2025 18:08:46 -0700


Hi Bastiaan,

On Sun, 10 Aug 2025, Bastiaan Braams wrote:

The third loop in the appended code iterates over the 32-bit integersand at each iteration a simple arithmetic operation is performed (thatcannot be optimized away). The fourth loop iterates over the 32-bitreals from -huge(0.0_real32) to huge(0.0_real32) by way of theieee_next_after function. The timings are reported and again we observethe factor of about 200 difference. Compiling with `gfortran -O5` I get3.9 seconds for the third loop and 883 seconds for the fourth loop on myIntel i7-1165G7.


Very odd.

I have a bit of C code which uses nextafterf to step through every singleREAL*4 (or float in C) from -INFINITY to +INFINITY, and then computes twoexponential functions, one being a single precision routine which needs 2branches, 3 comparisons and just shy of 30 (super-scalar) multiplicationsand additions, and the other being a double/REAL*8 exp() routine from thesystem library, and compares them. It takes 46 seconds as a single threadon a Xeon E5-2650v4 which is 40% slower than your CPU (its technology is 4years older than yours).


Why your code takes 800 seconds for doing a lot less work is beyond me.
Maybe somebody else can shed light on it?  Sorry, I have no experience in
using such routines from Fortran at the level of intricate knowledge to
address your problem.  If I have any brilliant ideas, I will let you know.

Regards - Damian

Re: Very high cost of ieee_next_after function

Reply via email to