Ken,

On Fri, Oct 23, 2009 at 1:41 PM, Kenneth Hoste <kenneth.ho...@ugent.be> wrote:
>
> The Intel documentation for Core i7 suggests that this is not the case
> on Nehalem, i.e. uops fused through micro-fusion are counted as 2 uops
> (while macro-fused uops are counted as 1), like you mentioned above.
>
> We missed this point somehow in your first reply, sorry about that.
>
> Thus, in order to obtain sensible numbers, it does indeed seem that
> we need to figure out the number of micro-fused uops in Core i7, and
> subtract that from the UOPS_RETIRED.ANY count we have now...
>
> It seems like there is no event for directly counting retied micro-fused
> uops
> however (while there is for UOPS_RETIRED.MACRO_FUSED).
>

> Are we missing something? Should we be able to get counts for retired
> micro-fused uops? If so, which event are you referring to? If not, any ideas
> on how we can obtain uops_retired counts on Core i7 where micro-fused
> uops are counted as 1?
>
It does not seem possible because you have 2 unknowns in the equation:

uops_retired.any = uops_retired.normal + uops_retired.micro +
2*uops_retired.micro.

You are missing micro and normal.
I thought you could may be do it by comparing what goes in with what comes out,
but that would only give you the wasted uops (on the wrong speculative path).

I will on that some more.
There are some information of interest in the Intel Optimization guide
(Appendix B2).

http://www.intel.com/Assets/PDF/manual/248966.pdf

------------------------------------------------------------------------------
Come build with us! The BlackBerry(R) Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9 - 12, 2009. Register now!
http://p.sf.net/sfu/devconference
_______________________________________________
perfmon2-devel mailing list
perfmon2-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/perfmon2-devel

Reply via email to