Re: [PATCH] powerpc/qspinlock: Add spinlock contention tracepoint

samir Wed, 30 Jul 2025 07:25:41 -0700

On 2025-07-25 13:44, Nysal Jan K.A. wrote:

Add a lock contention tracepoint in the queued spinlock slowpath.
Also add the __lockfunc annotation so that in_lock_functions()
works as expected.


Signed-off-by: Nysal Jan K.A. <ny...@linux.ibm.com>
---
 arch/powerpc/lib/qspinlock.c | 13 ++++++++-----
 1 file changed, 8 insertions(+), 5 deletions(-)

diff --git a/arch/powerpc/lib/qspinlock.cb/arch/powerpc/lib/qspinlock.c

index bcc7e4dff8c3..622e7f45c2ce 100644
--- a/arch/powerpc/lib/qspinlock.c
+++ b/arch/powerpc/lib/qspinlock.c
@@ -9,6 +9,7 @@
 #include <linux/sched/clock.h>
 #include <asm/qspinlock.h>
 #include <asm/paravirt.h>
+#include <trace/events/lock.h>

 #define MAX_NODES      4

@@ -708,8 +709,9 @@ static __always_inline void
queued_spin_lock_mcs_queue(struct qspinlock *lock, b
        qnodesp->count--;
 }

-void queued_spin_lock_slowpath(struct qspinlock *lock)
+void __lockfunc queued_spin_lock_slowpath(struct qspinlock *lock)
 {
+       trace_contention_begin(lock, LCB_F_SPIN);
        /*
         * This looks funny, but it induces the compiler to inline both
         * sides of the branch rather than share code as when the condition

@@ -718,16 +720,17 @@ void queued_spin_lock_slowpath(struct qspinlock*lock)

        if (IS_ENABLED(CONFIG_PARAVIRT_SPINLOCKS) && is_shared_processor()) {
                if (try_to_steal_lock(lock, true)) {
                        spec_barrier();
-                       return;
+               } else {
+                       queued_spin_lock_mcs_queue(lock, true);
                }
-               queued_spin_lock_mcs_queue(lock, true);
        } else {
                if (try_to_steal_lock(lock, false)) {
                        spec_barrier();
-                       return;
+               } else {
+                       queued_spin_lock_mcs_queue(lock, false);
                }
-               queued_spin_lock_mcs_queue(lock, false);
        }
+       trace_contention_end(lock, 0);
 }
 EXPORT_SYMBOL(queued_spin_lock_slowpath);


Hello,

I have verified the patch with the latest upstream Linux kernel, andhere are my findings:


———Kernel Version———
6.16.0-rc7-160000.11-default+
———perf --version———
perf version 6.16.rc7.g5f33ebd2018c

To test this patch, I used the Lockstorm benchmark, which rigorouslyexercises spinlocks from kernel space.


Benchmark repository: https://github.com/lop-devops/lockstorm

To capture all events related to the Lockstorm benchmark, I used thefollowing command:

cmd: perf lock record -a insmod lockstorm.ko
After generating the perf.data, I analyzed the results using:
cmd: perf lock contention -a -i perf.data

————Logs————
contended   total wait     max wait     avg wait         type   caller

6187241 12.50 m 2.30 ms 121.22 us spinlockkthread+0x16078 8.23 ms 209.87 us 105.47 us rwlock:Wdo_exit+0x37871 7.97 ms 208.07 us 112.24 us spinlockdo_exit+0x37868 4.18 ms 210.04 us 61.43 us rwlock:Wrelease_task+0xe063 3.96 ms 204.02 us 62.90 us spinlockrelease_task+0xe0115 477.15 us 19.69 us 4.15 us spinlockrcu_report_qs_rdp+0x40250 437.34 us 5.34 us 1.75 us spinlockraw_spin_rq_lock_nested+0x2432 156.32 us 13.56 us 4.88 us spinlockcgroup_exit+0x3419 88.12 us 12.20 us 4.64 us spinlockexit_fs+0x4412 23.25 us 3.09 us 1.94 us spinlocklock_hrtimer_base+0x4c1 18.83 us 18.83 us 18.83 us rwsem:Rbtrfs_tree_read_lock_nested+0x381 17.84 us 17.84 us 17.84 us rwsem:Wbtrfs_tree_lock_nested+0x3810 15.75 us 5.72 us 1.58 us spinlockraw_spin_rq_lock_nested+0x245 15.08 us 5.59 us 3.02 us spinlockmix_interrupt_randomness+0xb42 12.78 us 9.50 us 4.26 us spinlockraw_spin_rq_lock_nested+0x241 11.13 us 11.13 us 11.13 us spinlock__queue_work+0x3383 10.79 us 7.04 us 3.60 us spinlockraw_spin_rq_lock_nested+0x243 8.17 us 4.58 us 2.72 us spinlockraw_spin_rq_lock_nested+0x243 7.99 us 3.13 us 2.66 us spinlocklock_hrtimer_base+0x4c2 6.66 us 4.57 us 3.33 us spinlockfree_pcppages_bulk+0x503 5.34 us 2.19 us 1.78 us spinlockibmvscsi_handle_crq+0x1e42 3.71 us 2.32 us 1.85 us spinlock__hrtimer_run_queues+0x1b82 2.98 us 2.19 us 1.49 us spinlockraw_spin_rq_lock_nested+0x241 2.85 us 2.85 us 2.85 us spinlockraw_spin_rq_lock_nested+0x242 2.15 us 1.09 us 1.07 us spinlockraw_spin_rq_lock_nested+0x242 2.06 us 1.06 us 1.03 us spinlockraw_spin_rq_lock_nested+0x241 1.69 us 1.69 us 1.69 us spinlockraw_spin_rq_lock_nested+0x241 1.53 us 1.53 us 1.53 us spinlock__queue_work+0xd81 1.27 us 1.27 us 1.27 us spinlockpull_rt_task+0xa01 1.16 us 1.16 us 1.16 us spinlockraw_spin_rq_lock_nested+0x241 740 ns 740 ns 740 ns spinlockadd_device_randomness+0x5c1 566 ns 566 ns 566 ns spinlockraw_spin_rq_lock_nested+0x24

From the results, we were able to observe lock contention specificallyon spinlocks.


The patch works as expected.
Thank you for the patch!

Tested-by: Samir Mulani <sa...@linux.ibm.com>

Re: [PATCH] powerpc/qspinlock: Add spinlock contention tracepoint

Reply via email to