batch netlink messages - performance improvement

Yigal Reiss (yreiss) Thu, 25 Feb 2016 11:54:02 -0800

Hi,

I would like to check an idea.


I am using nfqueue for DPI in user space. I use the already existing batch 
verdict from user space. The problem with that is that reducing the number of 
user <--> kernel context switches is bound to 1/2, since kernel --> user space 
still reports every single packet. So if I have batch verdict for every 25 or 
50 packets, then still I only reduced the number of switches by an order of 2. 

So I tried batching the unicast netlink messages (carrying the packets) from 
kernel to user space. I do that by calling sk->sk_data_ready(sk); (in 
__netlink_sendskb() in af_netlink.c) only every [N] packets. This seems to 
contribute similar performance improvements as the batch verdict.

However I have no experience in kernel programming and currently I only 
implemented a quick and dirty hack (no timeout, assuming a single socket...) 
just to demonstrate the improvement. My question is therefore whether such an 
improvement could be interesting for the main kernel. Does it bear any problems 
etc.

If this suggestion makes sense, how would you suggest proceed with this idea? I 
could continue and start working on a patch, but since as I wrote I have no 
experience in kernel programming I would like to have some thumbs up for the 
directions I'm taking, what makes sense and what's not etc so I don't waste my 
and other people time.

B.t.w., I saw that there is another potential improvement which is mmaping the 
packets to user space. I couldn't figure out whether this feature is complete 
in any kernel version and is it ready to use. 

Thanks,
Yigal

--
To unsubscribe from this list: send the line "unsubscribe netfilter-devel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

batch netlink messages - performance improvement

Reply via email to