> $ ./task -e LLC_MISSES,LLC_REFERENCES ./mcol 
> Allocating 16 MiB for 1024x1024 matrix
> 1677.72MiB/s
> 
>            2,126,459 LLC_MISSES (4,739,259,550 : 4,739,259,550)
>          131,373,392 LLC_REFERENCES (4,739,259,550 : 4,739,259,550)

what compiler options did you use when compiling?

on my core2 machine when compiled with "-O2 -Wall" I get
Allocating 16 MiB for 1024x1024 matrix
infMiB/s

              16,314 LLC_MISSES (0.00% scaling, ena=32,442,281, run=32,442,281)
              60,686 LLC_REFERENCES (0.00% scaling, ena=32,442,281, 
run=32,442,281)

This is because your code never uses the matrix values, so the compiler is 
free to optimize them out as a nop.  If you're not compiling with 
optimizations enabled then you're likely to see inefficient memory access 
patterns.

Vince
------------------------------------------------------------------------------
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing
http://p.sf.net/sfu/novell-sfdev2dev
_______________________________________________
perfmon2-devel mailing list
perfmon2-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/perfmon2-devel

Reply via email to