Hi Ram, I was going to use -fp-accuracy option. I found it is very counter-initutive when "-fp-accuracy=relaxed" means less fp freedom but more fp accuracy. "relaxed" does not sound like improving accuracy at all.
Can we change the setting of -fp-accuracy to be more easily understandable, like "strict|strict-fma|limited|relaxed"? Also, the default -O3 setting does not map to any of -fp-accuracy value, which seems strange to me. Thanks, Min On Wed, Apr 27, 2011 at 5:47 AM, Ramanarayanan, Ramshankar <ramshankar.ramanaraya...@amd.com> wrote: > Sun, > > I think my choice of words contradict with the keywords used in the flags. > > The main change here is enabling -OPT:roundoff=2 at -O3 instead of > -OPT:roundoff=1. This change enables fast-math functions, aggressive loop > nest optimizations, reassociation on floating point expressions and more > aggressive round-off settings. These aggressive floating point optimizations > improve performance when using the -O3 flag but may affect floating point > accuracy. The use of -fp-accuracy=relaxed in addition to -O3 is recommended > for cases which need more floating point accuracy. -fp-accuracy=relaxed > automatically sets -OPT:roundoff=1. User may also use -OPT:roundoff=1. > > Please let me know if you have questions. > > Ram > > -----Original Message----- > From: Sun Chan [mailto:sun.c...@gmail.com] > Sent: Wednesday, April 27, 2011 5:02 PM > To: Ramanarayanan, Ramshankar > Cc: open64-devel@lists.sourceforge.net > Subject: Re: [Open64-devel] code review request for update to O3 flag > > you are really saying that to get back previous behavior, one needs to > -fp_accuracy=not_relaxed or something to that effect? I don't follow > your message. > Sun > > On Wed, Apr 27, 2011 at 6:37 PM, Ramanarayanan, Ramshankar > <ramshankar.ramanaraya...@amd.com> wrote: >> Could a gate keeper approve this patch? >> >> >> >> This update enhances performance of the compiled code on X8664 when using >> the O3 flag. Improvements come mainly from relaxing the floating point >> accuracy setting at O3. This enables a wide range of optimizations including >> loop nest optimizations and associative redundancy elimination >> optimizations. Given this change, users will need to use >> -fp-accuracy=relaxed flag in addition to -O3 if they require the earlier >> floating point precision. During subsequent tuning we found that the bad >> reference bias heuristic affects the computed cache costs and leads to >> incorrect choice of inner loops and is thus ignored. >> >> >> >> Following tests have been conducted with this change. >> >> >> >> 1: No compiler time failure for x86 build >> >> 2: SPEC CPU 2006 validated with AMD flags and with O3 flag >> >> 3: The gcc regression suite has no new failures on x86/Linux >> >> >> >> Best regards, >> >> Ram >> >> >> >> Ramshankar Ramanarayanan >> >> Member of Technical Staff >> >> Open Source Compiler Engineering >> >> Advanced Micro Devices, Bangalore >> >> >> >> >> >> ------------------------------------------------------------------------------ >> WhatsUp Gold - Download Free Network Management Software >> The most intuitive, comprehensive, and cost-effective network >> management toolset available today. Delivers lowest initial >> acquisition cost and overall TCO of any competing solution. >> http://p.sf.net/sfu/whatsupgold-sd >> _______________________________________________ >> Open64-devel mailing list >> Open64-devel@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/open64-devel >> >> > > > > ------------------------------------------------------------------------------ > WhatsUp Gold - Download Free Network Management Software > The most intuitive, comprehensive, and cost-effective network > management toolset available today. Delivers lowest initial > acquisition cost and overall TCO of any competing solution. > http://p.sf.net/sfu/whatsupgold-sd > _______________________________________________ > Open64-devel mailing list > Open64-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/open64-devel > ------------------------------------------------------------------------------ FREE DOWNLOAD - uberSVN with Social Coding for Subversion. Subversion made easy with a complete admin console. Easy to use, easy to manage, easy to install, easy to extend. Get a Free download of the new open ALM Subversion platform now. http://p.sf.net/sfu/wandisco-dev2dev _______________________________________________ Open64-devel mailing list Open64-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/open64-devel