> Also, that for_each_set_bit() loop, trying to find the events to > update... > > Can't we, instead, make the SLOTS update advance 5 running counters in > cpuc and feed the events off of that?
The original patches implemented this through a perf transaction and a cache. I think what you're suggesting is similar to the old cache. -Andi

