The following two patches optimise try_to_wake_up() when the wakee is descheduling. In a vanilla kernel, there can be excessive time spent spinning on p->on_rq. This is fine if it's a strictly synchronous wakeup and the waker is going to sleep but in other cases, the waker spins until it can do work that can be deferred to the wakee.
The first patch frontloads work that can be done before p->on_rq is checked. If it's a wakeup on a CPU that does not share cache then the wakelist is used instead of spinning. The second patch goes a little further and uses the wakelist if the wakee is descheduling and is the only task running on the target CPU. The performance impact is documented in the changelog of the second patch. kernel/sched/core.c | 81 ++++++++++++++++++++++++++++++++------------ kernel/sched/sched.h | 3 +- 2 files changed, 61 insertions(+), 23 deletions(-) -- 2.26.1

