Hi,

This is more of a Nehalem question than perfmon one. Apologies if I am 
completely out of place.

I am trying to analyze some benchmarks on Intel Nehalem processor and 
see how much of their stall cycles are related to memory access. I 
looked at the documents: "Intel® 64 and IA-32 Architectures Optimization 
Reference Manual" and "Performance Analysis Guide for Intel® Core™ i7 
Processor and Intel® Xeon™ 5500 processors".

I could gather that: “Total Cycles“ = UOPS_EXECUTED.CORE_STALLS_CYCLES + 
UOPS_EXECUTED.CORE_ACTIVE_CYCLES where Total Cycles is 
CPU_CLK_UNHALTED.THREAD. I also understand that memory related accesses 
are through ports 2,3 and 4; where as, ALU related operations are 
through ports 0, 1 and 5.

I could find UOPS_EXECUTED.PORT015_STALL_CYCLES counter to get ALU 
related stalls but no counter to get memory related stalls. Counter 
UOPS_EXECUTED.PORT234_CORE seems to be overall memory UOPS and not stall 
cycles. Could anyone suggest how to identify memory related stalls?

Also, for the programs I ran, UOPS_EXECUTED.PORT015_STALL_CYCLES was 
greater than UOPS_EXECUTED.CORE_STALLS_CYCLES. Does that make sense?

Thanks,
Vineeth

------------------------------------------------------------------------------
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
_______________________________________________
perfmon2-devel mailing list
perfmon2-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/perfmon2-devel

Reply via email to