Re: [lopsa-tech] tcp 0 window

Doug Hughes Fri, 14 Sep 2012 10:45:26 -0700

On 9/14/2012 1:17 PM, Paul Graydon wrote:

On 09/14/2012 05:37 AM, Brent Chapman wrote:
Aggregated network links involving multiple parallel circuitsgenerally use some sort of hash on the 5-tuple (src ip, dest ip,protocol, src port, dest port) so that packets for a given TCP or UDPsession all get sent down the same parallel circuit; this is an easyway to help ensure that the packets don't get re-ordered, which manyprotocols are sensitive to. However, if the particular "sameparallel circuit" that they get sent down is broken, as appears tohave been the case here, you can wind up with behavior like what yousaw: certain sessions (that happen to get hashed down those brokencircuits) break horribly, while others (that get hashed downnon-broken circuits) are just fine.
I've never really delved into the networking aspects of aggregation,it's never been something I've had any need to utilise, so forgive meif these are stupid questions.
Under circumstances with which a port goes down, would the linkaggregation generally be fine? The system would presumably be smartenough identify that the port isn't working and stop routing trafficthat way? I'm assuming the failure in this case is that the packetloss was too slight enough to disrupt the aggregation, but disruptiveenough to mess things up?

yes, link aggregation provides redundancy/failover and increasedbandwidth all in one. It's active-active. Worst case it providesredundancy against physical link issues (cable cut or other failure),and best case with chassis-based or stacked switches you can havecomplete redundancy against even switch or linecard failures. It's avery mature technology.

Why would this disrupt TCPs guarantee processes (admittedly I'massuming the application traffic was TCP and not UDP)? Presumably thepacket would fail to reach the other side so the sender would resendhaving failed to get an ack?

When you lose a tcp packet, the MSS (effectively bandwidth, but subtlydifferent) automatically drops by 1/2 (backoff). But, the increase backup to full speed is very gradual, only one MSS unit at a time. Thereforesmall amounts of loss can have drastic affects on throughput.


See: http://en.wikipedia.org/wiki/TCP_congestion_avoidance_algorithm

Here's an old ACM paper on TCP loss algorithms and what happens as lossapproaches .1%. (much terribleness -- Figure 4)http://ccr.sigcomm.org/archive/1997/jul97/ccr-9707-mathis.pdf




_______________________________________________
Tech mailing list
Tech@lists.lopsa.org
https://lists.lopsa.org/cgi-bin/mailman/listinfo/tech
This list provided by the League of Professional System Administrators
http://lopsa.org/

Re: [lopsa-tech] tcp 0 window

Reply via email to