Guillaume Cottenceau wrote:
> 
> Karl Mitchell <[EMAIL PROTECTED]> writes:
> 
> [...]
> 
> > > better with -O1 than with -O2 or superior!??
> >
> > Faster with -O1 than -O2 or -O3 under almost all criteria for speed. I can't
> > remember the source though and the figures I've got (below) aren't a good
> > comparison. If you replace -O1 with -O3 on test B (below) you'll see what I mean.
> >
> > By the way, I would urge against using gcc -ffast-math, as it can cause actual
> > errors in floating point routines. I am a numerical modeller and I found for one of
> > my floating point iterative routines the cumulative error was a few percent in one
> > case. This could have been more to do with the version of egcs I was using, but I
> > doubt it. Admittedly this doesn't matter for all programs, but perhaps something
> > like 'octave' would suffer?
> 
> I don't know precisely why this has been chosen. I think this is because,
> this is default parameters, and packagers are supposed to override them
> when needed.
> 
> [...]
> 
> > A) CFLAGS = -s -static -Wall
> > B) CFLAGS = -s -static -Wall -O1
> > C) CFLAGS = -s -static -O3 -fomit-frame-pointer -Wall -mpentiumpro
> >          -march=pentiumpro -fforce-addr -fforce-mem -malign-loops=2
> >          -malign-functions=4 -malign-jumps=2 -funroll-loops
> >          -fexpensive-optimisations -malign-double -fschedule-insns2
> >          -mwide-multiply
> >
> > Based on K6-233 Index
> >
> >         Mem Index       Integer index   FP Index
> > A       1.349           1.014           1.824
> > B       3.605           2.284           5.003
> > C       3.609           3.245           7.175
> 
> I don't understand.
> 
> You did not explain the numbers:
> 
> I suppose than B is faster than A because there is no optim for A. So
> given this, I interpret your numbers are "speed index" (opposed to "time
> consumed"), e.g. the greater the faster.
> 
> So, For all three things, C is faster than B.
> 
> Why do you say -O1 is faster than -O2, then? Is it something to do with
> the other parameters? (so why did you put other parameters for C..!?)
> 
> --
> Guillaume Cottenceau -- Distribution Developer for MandrakeSoft
> http://www.mandrakesoft.com/~gc/

Probably he is referring to an article appeared on cpureview showing
that sometimes on K7 -O1 would produce better results than -O2 or more.
Those index seems the Byte index benchmark.

Definitively I think we should do better benchmark for K7, Duron
maybe using ssbench benchmark

        http://nastol.astro.lu.se/~stefans/bench.html 

Volunteers?

I remember that we've already choosen to avoid -ffast-math for mathematical
packages (gnuplot, octave, etc.). Maybe we should explicitely use it into
the spec files, sort of

CFLAGS="$RPM_OPT_FLAGS -fno-fast-math"?

Bye.
Giuseppe.

Reply via email to