Marco Atzeri wrote:
On 2/27/2015 5:49 PM, Bengt Larsson wrote:
Below are two benchmarks that explore maximum floating point
performance. loopm6 is double precision floating point and loopm6fp is
parallell single-precision. They are manually unrolled multiply-add
loops.
I used to reach 2.8
On 2/27/2015 5:49 PM, Bengt Larsson wrote:
Below are two benchmarks that explore maximum floating point
performance. loopm6 is double precision floating point and loopm6fp is
parallell single-precision. They are manually unrolled multiply-add
loops.
I used to reach 2.8 and 11 GFlops on these.
Below are two benchmarks that explore maximum floating point
performance. loopm6 is double precision floating point and loopm6fp is
parallell single-precision. They are manually unrolled multiply-add
loops.
I used to reach 2.8 and 11 GFlops on these. Now I only get
2 and 6.
If you explore the
3 matches
Mail list logo