Hi Stephane,
I tried the pebs-ll sampling module of pfmon for the Intel Nehalem and the results I get are a bit unexpected. I sampled some programs using the following command: pfmon --smpl-module=pebs-ll -e MEM_INST_RETIRED:LATENCY_ABOVE_THRESHOLD --ld-lat-threshold=64 --long-smpl-periods=500 --short-smpl-periods=250 --pebs-ll-dcmiss-reasons --with-header <my_program> I thought that by setting the threshold to 64, mostly L3 misses will be sampled (reason 0xA to 0xD). However, only around 10% of the samples I obtained list L3 misses as the source of the sampled instruction. The remaining 90% of the samples is mostly caused by reasons 0x1 to 0x4 (L1 hits, pending L2 hits, L2 hits and L3 hits). These events should normally have a latency less than 64 cycles so they should have been filtered by the threshold value I initially set. If I set the latency to 128 and decrease the sampling interval, the percentage of L3 misses increases to around 30%, but on-chip cache hits still don't disappear. Do you have any idea why this anomaly occurs? Many thanks. Best regards, Zoltan ------------------------------------------------------------------------------ The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://ad.doubleclick.net/clk;226879339;13503038;l? http://clk.atdmt.com/CRS/go/247765532/direct/01/ _______________________________________________ perfmon2-devel mailing list perfmon2-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/perfmon2-devel