Sun,

I think my choice of words contradict with the keywords used in the flags.

The main change here is enabling -OPT:roundoff=2 at -O3 instead of 
-OPT:roundoff=1. This change enables fast-math functions, aggressive loop nest 
optimizations, reassociation on floating point expressions and more aggressive 
round-off settings. These aggressive floating point optimizations improve 
performance when using the -O3 flag but may affect floating point accuracy. The 
use of -fp-accuracy=relaxed in addition to -O3 is recommended for cases which 
need more floating point accuracy. -fp-accuracy=relaxed automatically sets 
-OPT:roundoff=1. User may also use -OPT:roundoff=1.

Please let me know if you have questions.

Ram

-----Original Message-----
From: Sun Chan [mailto:sun.c...@gmail.com] 
Sent: Wednesday, April 27, 2011 5:02 PM
To: Ramanarayanan, Ramshankar
Cc: open64-devel@lists.sourceforge.net
Subject: Re: [Open64-devel] code review request for update to O3 flag

you are really saying that to get back previous behavior, one needs to
-fp_accuracy=not_relaxed or something to that effect? I don't follow
your message.
Sun

On Wed, Apr 27, 2011 at 6:37 PM, Ramanarayanan, Ramshankar
<ramshankar.ramanaraya...@amd.com> wrote:
> Could a gate keeper approve this patch?
>
>
>
> This update enhances performance of the compiled code on X8664 when using
> the O3 flag. Improvements come mainly from relaxing the floating point
> accuracy setting at O3. This enables a wide range of optimizations including
> loop nest optimizations and associative redundancy elimination
> optimizations. Given this change, users will need to use
> -fp-accuracy=relaxed flag in addition to -O3 if they require the earlier
> floating point precision. During subsequent tuning we found that the bad
> reference bias heuristic affects the computed cache costs and leads to
> incorrect choice of inner loops and is thus ignored.
>
>
>
> Following tests have been conducted with this change.
>
>
>
> 1: No compiler time failure for x86 build
>
> 2: SPEC CPU 2006 validated with AMD flags and with O3 flag
>
> 3: The gcc regression suite has no new failures on x86/Linux
>
>
>
> Best regards,
>
> Ram
>
>
>
> Ramshankar Ramanarayanan
>
> Member of Technical Staff
>
> Open Source Compiler Engineering
>
> Advanced Micro Devices, Bangalore
>
>
>
>
>
> ------------------------------------------------------------------------------
> WhatsUp Gold - Download Free Network Management Software
> The most intuitive, comprehensive, and cost-effective network
> management toolset available today.  Delivers lowest initial
> acquisition cost and overall TCO of any competing solution.
> http://p.sf.net/sfu/whatsupgold-sd
> _______________________________________________
> Open64-devel mailing list
> Open64-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/open64-devel
>
>



------------------------------------------------------------------------------
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
_______________________________________________
Open64-devel mailing list
Open64-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/open64-devel

Reply via email to