Re: [Bloat] Goodput fraction w/ AQM vs bufferbloat

Jim Gettys Thu, 05 May 2011 09:01:49 -0700

On 04/30/2011 03:18 PM, Richard Scheffenegger wrote:

I'm curious, has anyone done some simulations to check if thefollowing qualitative statement holds true, and if, what thequantitative effect is:
With bufferbloat, the TCP congestion control reaction is unduelydelayed. When it finally happens, the tcp stream is likely facing a"burst loss" event - multiple consecutive packets get dropped. Worseyet, the sender with the lowest RTT across the bottleneck will likelystart to retransmit while the (tail-drop) queue is still overflowing.
And a lost retransmission means a major setback in bandwidth (exceptfor Linux with bulk transfers and SACK enabled), as the standard (RFCdocumented) behaviour asks for a RTO (1sec nominally, 200-500 mstypically) to recover such a lost retransmission...
The second part (more important as an incentive to the ISPs actually),how does the fraction of goodput vs. throughput change, when AQMschemes are deployed, and TCP CC reacts in a timely manner? Small ISPshave to pay for their upstream volume, regardless if that is "real"work (goodput) or unneccessary retransmissions.
When I was at a small cable ISP in switzerland last week, surelyenough bufferbloat was readily observable (17ms -> 220ms after 30 secof a bulk transfer), but at first they had the "not our problem" view,until I started discussing burst loss / retransmissions / goodput vsthroughput - with the latest point being a real commercial incentiveto them. (They promised to check if AQM would be available in the CPE/ CMTS, and put latency bounds in their tenders going forward).

I wish I had a good answer to your very good questions. Simulationwould be interesting though real daa is more convincing.

I haven't looked in detail at all that many traces to try to get a feelfor how much bandwidth waste there actually is, and more formal studieslike Netalyzr, SamKnows, or the Bismark project would be needed toquantify the loss on the network as a whole.

I did spend some time last fall with the traces I've taken. In those,I've typically been seeing 1-3% packet loss in the main TCP transfers.On the wireless trace I took, I saw 9% loss, but whether that isbufferbloat induced loss or not, I don't know (the data is out there forthose who might want to dig). And as you note, the losses areconcentrated in bursts (probably due to the details of Cubic, so I'm told).

I've had anecdotal reports (and some first hand experience) with muchhigher loss rates, for example from Nick Weaver at ICSI; but I believein playing things conservatively with any numbers I quote and I've notgotten consistent results when I've tried, so I just report what's inthe packet captures I did take.

A phenomena that could be occurring is that during congestion avoidance(until TCP loses its cookies entirely and probes for a higher operatingpoint) that TCP is carefully timing it's packets to keep the buffersalmost exactly full, so that competing flows (in my case, simple pings)are likely to arrive just when there is no buffer space to accept themand therefore you see higher losses on them than you would on the singleflow I've been tracing and getting loss statistics from.


People who want to look into this further would be a great help.
                - Jim


_______________________________________________
Bloat mailing list
[email protected]
https://lists.bufferbloat.net/listinfo/bloat

Re: [Bloat] Goodput fraction w/ AQM vs bufferbloat

Reply via email to