Hi, This is more of a Nehalem question than perfmon one. Apologies if I am completely out of place.
I am trying to analyze some benchmarks on Intel Nehalem processor and see how much of their stall cycles are related to memory access. I looked at the documents: "Intel® 64 and IA-32 Architectures Optimization Reference Manual" and "Performance Analysis Guide for Intel® Core™ i7 Processor and Intel® Xeon™ 5500 processors". I could gather that: “Total Cycles“ = UOPS_EXECUTED.CORE_STALLS_CYCLES + UOPS_EXECUTED.CORE_ACTIVE_CYCLES where Total Cycles is CPU_CLK_UNHALTED.THREAD. I also understand that memory related accesses are through ports 2,3 and 4; where as, ALU related operations are through ports 0, 1 and 5. I could find UOPS_EXECUTED.PORT015_STALL_CYCLES counter to get ALU related stalls but no counter to get memory related stalls. Counter UOPS_EXECUTED.PORT234_CORE seems to be overall memory UOPS and not stall cycles. Could anyone suggest how to identify memory related stalls? Also, for the programs I ran, UOPS_EXECUTED.PORT015_STALL_CYCLES was greater than UOPS_EXECUTED.CORE_STALLS_CYCLES. Does that make sense? Thanks, Vineeth ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ perfmon2-devel mailing list perfmon2-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/perfmon2-devel