Re: [perfmon2] wrong count given by task_cpu example for libpfm4

2013-09-09 Thread Bhavishya Goel
Yes, that's what I am doing. Below is an example of the counts I get from task vs task_cpu from a 4-threaded binary: $> task -i -e INSTRUCTIONS_RETIRED ./binary 27 708 362 452 INSTRUCTIONS_RETIRED $> task_cpu -i -e INSTRUCTIONS_RETIRED ./binary CPU0 29 529 250 689 INSTRUCTIONS_RETIRED CPU1 26 9

Re: [perfmon2] wrong count given by task_cpu example for libpfm4

2013-09-09 Thread Bhavishya Goel
Thanks Stephane. That works for me. On Mon, Sep 9, 2013 at 6:58 PM, Stephane Eranian wrote: > Hi, > > I get the same thing with my own little multithreaded benchmark. > > I think it has to do with what happens in per-thread/per-cpu mode > with inheritance. I think each events of the child thread

Re: [perfmon2] wrong count given by task_cpu example for libpfm4

2013-09-09 Thread Stephane Eranian
On Mon, Sep 9, 2013 at 4:35 PM, Bhavishya Goel wrote: > It is multi-threaded. > > Then you need to add up all the counts from all the CPUs. > On Mon, Sep 9, 2013 at 4:26 PM, Stephane Eranian > wrote: >> >> On Tue, Sep 3, 2013 at 8:51 AM, Bhavishya Goel >> wrote: >> > Hi, >> > >> > I am using "t

Re: [perfmon2] wrong count given by task_cpu example for libpfm4

2013-09-09 Thread Stephane Eranian
Hi, I get the same thing with my own little multithreaded benchmark. I think it has to do with what happens in per-thread/per-cpu mode with inheritance. I think each events of the child threads are propagated to each parent. I get 12x the amount on my 12 CPUs workstation. If you drop -i in task_c

Re: [perfmon2] wrong count given by task_cpu example for libpfm4

2013-09-09 Thread Bhavishya Goel
It is multi-threaded. On Mon, Sep 9, 2013 at 4:26 PM, Stephane Eranian wrote: > On Tue, Sep 3, 2013 at 8:51 AM, Bhavishya Goel > wrote: > > Hi, > > > > I am using "task_cpu" example in perf_examples folder of libpfm4.4 (linux > > kernel 3.11-rc7, micrarchitecture: haswell). The counter numbers

Re: [perfmon2] wrong count given by task_cpu example for libpfm4

2013-09-09 Thread Stephane Eranian
On Tue, Sep 3, 2013 at 8:51 AM, Bhavishya Goel wrote: > Hi, > > I am using "task_cpu" example in perf_examples folder of libpfm4.4 (linux > kernel 3.11-rc7, micrarchitecture: haswell). The counter numbers that I get > from task_cpu seem to be wrong as they are vastly different than what I get > fr

[perfmon2] wrong count given by task_cpu example for libpfm4

2013-09-02 Thread Bhavishya Goel
Hi, I am using "task_cpu" example in perf_examples folder of libpfm4.4 (linux kernel 3.11-rc7, micrarchitecture: haswell). The counter numbers that I get from task_cpu seem to be wrong as they are vastly different than what I get from "task" example and Intel's PCM-TSX tool. This is an example of