From: Ashutosh Kaushik <[email protected]>
The patch fixes the issues with check of global flag "rps_needed" in RX Path
(which process packets in TCP/IP stack like netif_rx and netif_receive_skb
functions)
These functions have flag CONFIG RPS which is enabled default in kernel and to
enter in RPS mode, it depends on variable rps_needed.
This variable is updated whenever value in
/sys/class/net/<device>/queues/rx-0/rps_cpus is being changed.
There are 2 scenarios where it is executing extra piece of code even when
results would be same every time:-
1) Suppose in system more than one networking devices are connected i.e. wired
(eth0) and wireless (wlan0).
If I enable RPS using above method for wlan0, then it sets atomic variable
in rps_needed flag which is global.
Now,whenever traffic uses wired network, then it will execute that extra
piece of code in RX path because of only dependency on global rps_needed
variable as below:
#ifdef CONFIG_RPS
if (static_key_false(&rps_needed)) {
struct rps_dev_flow voidflow, *rflow = &voidflow;
int cpu = get_rps_cpu(skb->dev, skb, &rflow);
if (cpu >= 0) {
ret = enqueue_to_backlog(skb, cpu, &rflow->last_qtail);
rcu_read_unlock();
return ret;
}
}
#endif
while every time, value returned from get_rps_cpu will be < 0 as
rps_map value is not set for this network device using
/sys/class/net/device/queues/rx-0/rps_cpus.
And it will every time execute get_rps_cpu function with fail case in
IF condition which will be an extra overhead for packet processing.
2) Another scenario is as: Suppose that we have enable RPS for wireless device
say wlan0 using above specified method which will set value of rps_needed.
After doing test with set value of rps_cpus in sysfs, we do rmmod driver of
wireless device.
Next time again when we do insmod without rebooting system, it always hit
below code with fail case:
#ifdef CONFIG_RPS
if (static_key_false(&rps_needed)) {
struct rps_dev_flow voidflow, *rflow = &voidflow;
int cpu;
preempt_disable();
rcu_read_lock();
cpu = get_rps_cpu(skb->dev, skb, &rflow);
if (cpu < 0)
cpu = smp_processor_id();
ret = enqueue_to_backlog(skb, cpu, &rflow->last_qtail);
rcu_read_unlock();
preempt_enable();
} else
#endif
The reason behind this overhead of hitting this code with false case is same
because before doing rmmod, we enabled RPS which set rps_needed flag.
Next time when we do insmod, it will just create entry for network device in
sysfs with default value which is 0 for rps_cpus. It implies that RPS is
disable for that device.
But due to unchanged value of rps_needed variable, it goes into IF condition
every time (which is failed in get_rps_cpu function) even rps_cpus is 0.
Because if we do not enable RPS for that network device, rps_map is not set.
The patch adds a check to these two RX functions which will check RPS
availability locally for device specific.
Signed-off-by: Ashutosh Kaushik <[email protected]>
---
net/core/dev.c | 8 ++++++--
1 file changed, 6 insertions(+), 2 deletions(-)
diff --git a/net/core/dev.c b/net/core/dev.c
index 5df6cbc..1aa4402 100644
--- a/net/core/dev.c
+++ b/net/core/dev.c
@@ -3531,12 +3531,14 @@ drop:
static int netif_rx_internal(struct sk_buff *skb)
{
int ret;
+ struct netdev_rx_queue *dev_rxqueue = skb->dev->_rx;
net_timestamp_check(netdev_tstamp_prequeue, skb);
trace_netif_rx(skb);
#ifdef CONFIG_RPS
- if (static_key_false(&rps_needed)) {
+ if (static_key_false(&rps_needed) &&
+ dev_rxqueue->rps_map && dev_rxqueue->rps_map->len) {
struct rps_dev_flow voidflow, *rflow = &voidflow;
int cpu;
@@ -3986,6 +3988,7 @@ static int __netif_receive_skb(struct sk_buff *skb)
static int netif_receive_skb_internal(struct sk_buff *skb)
{
int ret;
+ struct netdev_rx_queue *dev_rxqueue = skb->dev->_rx;
net_timestamp_check(netdev_tstamp_prequeue, skb);
@@ -3995,7 +3998,8 @@ static int netif_receive_skb_internal(struct sk_buff *skb)
rcu_read_lock();
#ifdef CONFIG_RPS
- if (static_key_false(&rps_needed)) {
+ if (static_key_false(&rps_needed) &&
+ dev_rxqueue->rps_map && dev_rxqueue->rps_map->len) {
struct rps_dev_flow voidflow, *rflow = &voidflow;
int cpu = get_rps_cpu(skb->dev, skb, &rflow);
--
1.9.1
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/