On Apr 27, 2009, at 16:43 , stephane eranian wrote: > Hi, > > > On Wed, Apr 22, 2009 at 7:40 AM, Nagy Mostafa > <nagy.most...@gmail.com> wrote: >> Hi, >> What is the most accurate way to measure trace cache miss rate on >> Xeon (dual >> core) using pfmon? I looked into TC_DELIVER_MODE but it counts the >> cycles >> for each operation mode (Deliver or Build), and it also include >> cycle counts >> for both processors (I am only interested in the one running my >> code). >> > > I am afraid that's all there is at your disposal. Note that > TC_DELIVER has > unit-masks to filter which CPU (I assume which hyperthread) you > measure. > If you guarantee the other thread is not running, that should be > enough, shouldn't it? > Otherwise you can disable HT.
I believe there is another option. I count trace cache misses using the BPU_fetch_request event, which has an event mask bit for counting trace cache lookup misses. I'm not 100% sure that this will count each and every trace cache miss, because it depends whether or not the "Branch Predictor unit" always comes into play when determining the next sequence of instructions to bring into the pipeline. I'm not sure if it comes into play when it's just straight-line code (i.e. no jumps). I'm not using perfmon for this, I program the counters myself using the perfex tool that comes with the perfctr patch, so I'm not sure if perfmon support this event. K. > > > >> thanks, >> - nagy >> >> My cpu info: >> model name : Intel(R) Xeon(TM) CPU 2.40GHz >> flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge >> mca cmov >> pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe pebs bts >> >> Events available: >> $ pfmon -l >> TC_DELIVER_MODE >> BPU_FETCH_REQUEST >> ITLB_REFERENCE >> MEMORY_CANCEL >> MEMORY_COMPLETE >> LOAD_PORT_REPLAY >> STORE_PORT_REPLAY >> MOB_LOAD_REPLAY >> PAGE_WALK_TYPE >> BSQ_CACHE_REFERENCE >> IOQ_ALLOCATION >> IOQ_ACTIVE_ENTRIES >> FSB_DATA_ACTIVITY >> BSQ_ALLOCATION >> BSQ_ACTIVE_ENTRIES >> SSE_INPUT_ASSIST >> PACKED_SP_UOP >> PACKED_DP_UOP >> SCALAR_SP_UOP >> SCALAR_DP_UOP >> 64BIT_MMX_UOP >> 128BIT_MMX_UOP >> X87_FP_UOP >> TC_MISC >> GLOBAL_POWER_EVENTS >> TC_MS_XFER >> UOP_QUEUE_WRITES >> RETIRED_MISPRED_BRANCH_TYPE >> RETIRED_BRANCH_TYPE >> RESOURCE_STALL >> WC_BUFFER >> B2B_CYCLES >> BNR >> SNOOP >> RESPONSE >> FRONT_END_EVENT >> EXECUTION_EVENT >> REPLAY_EVENT >> INSTR_RETIRED >> UOPS_RETIRED >> UOPS_TYPE >> BRANCH_RETIRED >> MISPRED_BRANCH_RETIRED >> X87_ASSIST >> MACHINE_CLEAR >> INSTR_COMPLETED >> >> >> ------------------------------------------------------------------------------ >> Stay on top of everything new and different, both inside and >> around Java (TM) technology - register by April 22, and save >> $200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco. >> 300 plus technical and hands-on sessions. Register today. >> Use priority code J9JMT32. http://p.sf.net/sfu/p >> _______________________________________________ >> perfmon2-devel mailing list >> perfmon2-devel@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/perfmon2-devel >> >> > > ------------------------------------------------------------------------------ > Crystal Reports - New Free Runtime and 30 Day Trial > Check out the new simplified licensign option that enables unlimited > royalty-free distribution of the report engine for externally > facing > server and web deployment. > http://p.sf.net/sfu/businessobjects > _______________________________________________ > perfmon2-devel mailing list > perfmon2-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/perfmon2-devel -- Kenneth Hoste Paris research group - ELIS - Ghent University, Belgium email: kenneth.ho...@elis.ugent.be website: http://www.elis.ugent.be/~kehoste blog: http://boegel.kejo.be ------------------------------------------------------------------------------ Register Now & Save for Velocity, the Web Performance & Operations Conference from O'Reilly Media. Velocity features a full day of expert-led, hands-on workshops and two days of sessions from industry leaders in dedicated Performance & Operations tracks. Use code vel09scf and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf _______________________________________________ perfmon2-devel mailing list perfmon2-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/perfmon2-devel