Hi, On 2 August 2016 at 09:48, David Carrillo-Cisneros <[email protected]> wrote: > There is an optimization in perf_cgroup_sched_{in,out} that skips the > switch of cgroup events if the old and new cgroups in a task switch are > the same. This optimization interacts with the current code in two ways > that cause a cpu context's cgroup (cpuctx->cgrp) to be NULL even if a > cgroup event matches the current task. These are: > > 1. On creation of the first cgroup event in a CPU: In current code, > cpuctx->cpu is only set in perf_cgroup_sched_in, but due to the > aforesaid optimization, perf_cgroup_sched_in will run until the next > cgroup switches in that cpu. This may happen late or never happen, > depending on system's number of cgroups, cpu load, etc. > > 2. On deletion of the last cgroup event in a cpuctx: In list_del_event, > cpuctx->cgrp is set NULL. Any new cgroup event will not be sched in > because cpuctx->cgrp == NULL until a cgroup switch occurs and > perf_cgroup_sched_in is executed (updating cpuctx->cgrp). > > This patch fixes both problems by setting cpuctx->cgrp in list_add_event, > mirroring what list_del_event does when removing a cgroup event from CPU > context, as introduced in: > commit 68cacd29167b ("perf_events: Fix stale ->cgrp pointer in > update_cgrp_time_from_cpuctx()") > > With this patch, cpuctx->cgrp is always set/clear when installing/removing > the first/last cgroup event in/from the cpu context. With cpuctx->cgrp > correctly set, event_filter_match works as intended when events are > sched in/out. > > The problem is easy to observe in a machine with only one cgroup: > > $ perf stat -e cycles -I 1000 -C 0 -G / > # time counts unit events > 1.000161699 <not counted> cycles / > 2.000355591 <not counted> cycles / > 3.000565154 <not counted> cycles / > 4.000951350 <not counted> cycles / > > After the fix, the output is as expected: > > $ perf stat -e cycles -I 1000 -a -G / > # time counts unit events > 1.004699159 627342882 cycles / > 2.007397156 615272690 cycles / > 3.010019057 616726074 cycles / > > Rebased at peterz/queue/perf/core. > > Changes in v2: > - Fix build error when no CONFIG_CGROUP_PERF. > - Unify add and del cases into list_update_cgroup_event. > - Remove cgroup exclusive variables from builds > without CONFIG_CGROUP_PERF. > > Signed-off-by: David Carrillo-Cisneros <[email protected]>
Is this supposed to fix the bug I reported here? https://www.mail-archive.com/[email protected]/msg1197757.html If so, you may want to add: 1) Fixes: f2fb6bef92514 ("perf/core: Optimize side-band event delivery") 2) Reported-by: Vegard Nossum <[email protected]> 3) a link to the thread and I can give it a test to see if it fixes the problem I was running into. If not, please ignore :-) Thanks, Vegard

