the accuracy is controlled by round_off etc, O2, O3, .. etc are tied with roundoff, iEEE internal flags the fp-accuracy is just a gcc option
On Sat, Aug 13, 2011 at 6:07 AM, Min Zhao <mzhao...@gmail.com> wrote: > Hi Ram, > > I was going to use -fp-accuracy option. I found it is very > counter-initutive when "-fp-accuracy=relaxed" means less fp freedom > but more fp accuracy. "relaxed" does not sound like improving accuracy > at all. > > Can we change the setting of -fp-accuracy to be more easily > understandable, like "strict|strict-fma|limited|relaxed"? > > Also, the default -O3 setting does not map to any of -fp-accuracy > value, which seems strange to me. > > Thanks, > > Min > > > On Wed, Apr 27, 2011 at 5:47 AM, Ramanarayanan, Ramshankar > <ramshankar.ramanaraya...@amd.com> wrote: >> Sun, >> >> I think my choice of words contradict with the keywords used in the flags. >> >> The main change here is enabling -OPT:roundoff=2 at -O3 instead of >> -OPT:roundoff=1. This change enables fast-math functions, aggressive loop >> nest optimizations, reassociation on floating point expressions and more >> aggressive round-off settings. These aggressive floating point optimizations >> improve performance when using the -O3 flag but may affect floating point >> accuracy. The use of -fp-accuracy=relaxed in addition to -O3 is recommended >> for cases which need more floating point accuracy. -fp-accuracy=relaxed >> automatically sets -OPT:roundoff=1. User may also use -OPT:roundoff=1. >> >> Please let me know if you have questions. >> >> Ram >> >> -----Original Message----- >> From: Sun Chan [mailto:sun.c...@gmail.com] >> Sent: Wednesday, April 27, 2011 5:02 PM >> To: Ramanarayanan, Ramshankar >> Cc: open64-devel@lists.sourceforge.net >> Subject: Re: [Open64-devel] code review request for update to O3 flag >> >> you are really saying that to get back previous behavior, one needs to >> -fp_accuracy=not_relaxed or something to that effect? I don't follow >> your message. >> Sun >> >> On Wed, Apr 27, 2011 at 6:37 PM, Ramanarayanan, Ramshankar >> <ramshankar.ramanaraya...@amd.com> wrote: >>> Could a gate keeper approve this patch? >>> >>> >>> >>> This update enhances performance of the compiled code on X8664 when using >>> the O3 flag. Improvements come mainly from relaxing the floating point >>> accuracy setting at O3. This enables a wide range of optimizations including >>> loop nest optimizations and associative redundancy elimination >>> optimizations. Given this change, users will need to use >>> -fp-accuracy=relaxed flag in addition to -O3 if they require the earlier >>> floating point precision. During subsequent tuning we found that the bad >>> reference bias heuristic affects the computed cache costs and leads to >>> incorrect choice of inner loops and is thus ignored. >>> >>> >>> >>> Following tests have been conducted with this change. >>> >>> >>> >>> 1: No compiler time failure for x86 build >>> >>> 2: SPEC CPU 2006 validated with AMD flags and with O3 flag >>> >>> 3: The gcc regression suite has no new failures on x86/Linux >>> >>> >>> >>> Best regards, >>> >>> Ram >>> >>> >>> >>> Ramshankar Ramanarayanan >>> >>> Member of Technical Staff >>> >>> Open Source Compiler Engineering >>> >>> Advanced Micro Devices, Bangalore >>> >>> >>> >>> >>> >>> ------------------------------------------------------------------------------ >>> WhatsUp Gold - Download Free Network Management Software >>> The most intuitive, comprehensive, and cost-effective network >>> management toolset available today. Delivers lowest initial >>> acquisition cost and overall TCO of any competing solution. >>> http://p.sf.net/sfu/whatsupgold-sd >>> _______________________________________________ >>> Open64-devel mailing list >>> Open64-devel@lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/open64-devel >>> >>> >> >> >> >> ------------------------------------------------------------------------------ >> WhatsUp Gold - Download Free Network Management Software >> The most intuitive, comprehensive, and cost-effective network >> management toolset available today. Delivers lowest initial >> acquisition cost and overall TCO of any competing solution. >> http://p.sf.net/sfu/whatsupgold-sd >> _______________________________________________ >> Open64-devel mailing list >> Open64-devel@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/open64-devel >> > ------------------------------------------------------------------------------ FREE DOWNLOAD - uberSVN with Social Coding for Subversion. Subversion made easy with a complete admin console. Easy to use, easy to manage, easy to install, easy to extend. Get a Free download of the new open ALM Subversion platform now. http://p.sf.net/sfu/wandisco-dev2dev _______________________________________________ Open64-devel mailing list Open64-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/open64-devel