On Thu, 2005-08-12 at 17:57 +0100, Eric Dumazet wrote:

> > right, after i did this code, i realized that, and it is demonstrable
> > that #4 hurts, if only a little.
> > I'm sticking with my suggestion we go to #1,#2,#5
> 
> I would try another thing : #1,#2,#4bis
> 
> #4bis          prefetch(&next_skb->data);
> 
> instead of any combination of #4 or #5
> 
> #4             prefetch(next_skb);
> #5             prefetch(next_skb->data - NET_IP_ALIGN);
> 
> This way, the next time #1 is done (next loop), previous #4bis makes the 
> dereference hit the L1 cache : Prefetch should be more efficient.
> 

yep, I see a small difference with the above. The numbers went up by
about 1Kpps. 

Could the Robert/Jesse also verify this? I normally dont get excited by
an extra kpps these days;->

cheers,
jamal


-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to