On Mon, 2017-07-31 at 10:18 -0700, Shaohua Li wrote:
> From: Shaohua Li <[email protected]>
>
> In a syn flooding test, the fib6_table rwlock is a significant
> bottleneck. While converting the rwlock to rcu sounds straighforward,
> but is very challenging if it's possible. A percpu spinlock is quite
> trival for this problem since updating the routing table is a rare
> event. In my test, the server receives around 1.5 Mpps in syn flooding
> test without the patch in a dual sockets and 56-CPU system. With the
> patch, the server receives around 3.8Mpps, and perf report doesn't show
> the locking issue.
>
> +static inline void fib6_table_write_lock_bh(struct fib6_table *table)
> +{
> + int i;
> +
> + spin_lock_bh(per_cpu_ptr(table->percpu_tb6_lock, 0));
> + for_each_possible_cpu(i) {
> + if (i == 0)
> + continue;
> + spin_lock_nest_lock(per_cpu_ptr(table->percpu_tb6_lock, i),
> + per_cpu_ptr(table->percpu_tb6_lock, 0));
> + }
> +}
Your code assumes that cpu 0 is valid.
I would rather not hard code this knowledge.
Also this is not clear why you need the nested thing.