Re: [OpenAFS-devel] .35 sec rx delay bug?

Rainer Toebbicke Mon, 06 Nov 2006 05:46:08 -0800

While I've seen this 350 ms delay oddity about a year ago duringtests, I have not been able to reproduce the problem. At the time Iwas convinced that it was caused by ACKs being lost occasionally,actually the "ack every other packet" algorithm.

Lately however we've run RX tests again, worried about the pthreadedstack's performance which is significantly worse than the lwp one.

. There are a few more places in the protocol that need a "dpf" macroin order to make the RX trace useful. A lock (...), the current threadID in the output and microsecond resolution in rx_debugPrint are amust for any serious work.

. In order to work against performance drops due to high latency onemight be tempted to increase the window sizes, however the way the(single) send queue is organized this causes repeated traversals (inorder to recalculate the timeouts for example) starting to takemacroscopic amounts of time under locks. I worked on this a little,with so far the only result being more timeouts... ;-)


. The maximum window size is 255 (or 254...) due to the way the ACKs work.

. With bigger windows, and a routed network, the windows of 350 ms forACKs is actually low, the price for retransmits is high. Here is makessense to increase the timeout.

. Allocating new packets is done under a lock. As a result incomingACKs get processed late and contribute to keeping the queue size high.I introduced a "hint" in the call which causes the alloc to releaseand re-grab the lock between packets. That helped quite a lot.

. In the past free packets were queued instead of stacked... somethingwhich is level-2-cache counter-productive (for the headers). With thenew allocation system this might be different, I haven't checked.

. I'm currently trying to understand another puzzling case of ACKsbeing received but processed only about a millisecond later. Probablyyet another locking problem.

I manage to fill the GigE interface with about 100-110 MB/s(megabytes) when machines are on the same switch, more the 50-60 yousee when crossing a router. This is admittedly my own RX application,not rxperf.

Performance however drops dramatically once the sending end should dosomething in addition, such as reading a disk. No matter what doublebuffering tricks, if you're slow in producing data the send queue isempty whereas if you're fast it's not better either, with sweet-spotsdepending on 16 / 32 / 48 packet window sizes. Again, I suspect theimplementation of a single send/resend queue degrades once it fills up.


--
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Rainer Toebbicke
European Laboratory for Particle Physics(CERN) - Geneva, Switzerland
Phone: +41 22 767 8985       Fax: +41 22 767 7155
_______________________________________________
OpenAFS-devel mailing list
[email protected]
https://lists.openafs.org/mailman/listinfo/openafs-devel

Re: [OpenAFS-devel] .35 sec rx delay bug?

Reply via email to