Hi,
My question regards a platform equipped with 2 Intel Xeon X5650.
According to the perf wiki page
(https://perf.wiki.kernel.org/index.php/Tutorial), "by default perf stat
counts for all threads of the process and subsequent child processes and
threads" and "By default, perf stat counts in per-thread mode".
So a first question is what is the default: per thread or per process ?
Then, independently of the answer, I am wondering how does perf handles
per thread or per process regarding the scheduler and migrations. I
didn't find it explicitly in the Intel documentation but it seems
natural that hardware performance counters located on a given core are
only capable of counting event on this core and not on other cores. Is
it true ?
Moreover, the wiki page says that "When a thread migrated from one
processor to another, counters are saved on the current processor and
are restored on the new one" (this seems to confirm the answer to my
previous question above). It means that the scheduler is aware about
"perf" or that perf is able to register a hook into the scheduler. So I
guess this is done in the kernel part of perf (in the implementation of
the perf_event_open system call) and not in the user land part, is it true ?
Thanks
--
Manu
--
To unsubscribe from this list: send the line "unsubscribe linux-perf-users" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html