Hi:
Paul Leyland wrote:
> Actually, we at Microsoft Research in Cambridge have seen similar effects
> when compiling and running FFTW code. Our discovery is that the alignment
> of FP data values is critical. Get it wrong, and performance can plummet.
> Unless you set the alignment explicitly, it will be wrong approximately half
> the time.
Your right, I gained a 35% performance only with doing a simple trick to
be sure there were a 8-bytes alignement. On the other hand, I made the
FFTW library using long double float type (with a 'awful' 10-bytes long)
and the performance was near 65% in comparison with double float type
performance.
| Guillermo Ballester Valor |
| [EMAIL PROTECTED] |
| c/ cordoba, 19 |
| 18151-Ogijares (Spain) |
| (Linux registered user 1171811) |
_________________________________________________________________
Unsubscribe & list info -- http://www.scruz.net/~luke/signup.htm
Mersenne Prime FAQ -- http://www.tasam.com/~lrwiman/FAQ-mers