On Thu, 25 Oct 2018, Michael Petlan wrote:

> The ctests/flops test fails on HPE Moonshot machines:

so what type of processor is it exactly in this machine?
Also which event is being used by FP_INS /FP_OPS?

I gather it might be Xgene, and in that case

PRESET,PAPI_FP_INS,NOT_DERIVED,INST_SPEC_EXEC_VFP

some of your problems are coming from the "SPEC_EXEC" (speculatively 
executed) part.  Not it being 3 counts for each instruction, but the fact 
that the value seems to vary.

Also if you look at the ARM ARM for a generic VFP_SPEC events it says
"The counter counts the last micro-operation of each data engine 
floating-point instruction."  So if somehow the fmadd instruction is 
broken up into 3 vfp micro-ops internally then you might get the 3-count.

> 3) Shouldn't the test adjust the estimation of expected
> result based on whether it tests FP_INS or FP_OPS, instead
> of based on whether __powerpc__ is defined?

well, we should properly break out if a fmadd counts as one or two ops on 
an architecture, which is what the powerpc test was really trying to do.  

as you are finding, floating point operation events vary a lot from vendor 
to vendor and even chip to chip and getting info about what is going on 
can be difficult.  AMD fam17h (ryzen) has mysterious issues with the FP 
event counts too and I'm still waiting to see if anyone from AMD can tell 
me why the events are behaving that way.

Vince


_______________________________________________
perfmon2-devel mailing list
perfmon2-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/perfmon2-devel

Reply via email to