Re: [racket-users] IEEE 754 single precision float support

George Neuner Tue, 10 Apr 2018 01:49:19 -0700


On 4/10/2018 1:36 AM, [email protected] wrote:

For the applications I work on, double precision floats are too costlyto use; although the CPU cycle count to operate on doubles tend to bethe same as single precision floats on modern hardware, the bandwidthcost is too prohibitive. We really do need single precision floats,and in many cases, 16 bit half precision floats due to the bandwidthsavings.

Then you probably want SIMD vector ops too, which, AFAIK, are not yetsupported. FP math in Racket does use the SIMD unit on most targets,but normal math computes one value at a time, using only one slot perSIMD register, as opposed to the N slots available at the given precision.[This is the same as in C: if you want vector ops, you use SIMDintrinsics instead of the normal C operators.]

In Racket, there are tricks you can play with typed arrays and/or unsafeoperations to get more speed from bypassing the language's typesafeguards ... but you won't get vector ops AFAIK unless you drop into Ccode.

And again, there is no half precision available. Half precision isavailable only in GPUs or certain DSPs - no CPU implements it.

With regard to exactness, I don't need exactness to compare two singleprecision floats. I would like to have exactness in the ground truththat I compute to be able to calculate the error in the singleprecision float version of the computation. The idea is that Iimplement two versions of an algorithm. One uses the exact numberssupported by Racket and the other would use single precision floats,then I would like to compute error with (flulp-error x r) or somethingsimilar.

How long do you want to wait for "truth" calculations. Done usingeither rationals (software bigint / bigint fractions), or bigfloats(software adjustable width FP) with results converted to rational forcomparison, the truth calculation is going to be many orders ofmagnitude slower than hardware FP math.

Do you have enough memory? Rationals can expand to fill all availablespace.

Is there a better approach to do this kind of analysis?

You really haven't specified any "analysis" per se. Thus far you havesaid only that you want to execute two versions of the same algorithm:one using exact (or maybe high precision float) values, and one usinglow (single) precision values, and compare the results.

What you proposed is fine as far as it goes, but I question whethermeasuring ulps error really is what you want to do. That more typicallywould be done to compare answers computed to the same precision usingdifferent algorithms. In your case, the low precision value will likelylead to large errors vs the exact one - think about how intermediatevalues overflowing or underflowing might affect the end result.

Perhaps some kind of relative error measurement would be moreappropriate? Without knowing the algorithm in question, nobody canreally give better suggestions.

-Dale Kim


YMMV,
George

--
You received this message because you are subscribed to the Google Groups "Racket 
Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Re: [racket-users] IEEE 754 single precision float support

Reply via email to