Round-Robin becomes unbalanced when a peer dies and comes back

Mark Nottingham Thu, 05 Jun 2008 18:46:47 -0700

<http://www.squid-cache.org/bugs/show_bug.cgi?id=2376>

When a peer goes down and then comes back, its round-robin countersaren'treset, causing it to get a disproportionate amount of traffic until it"catches

up" with the rest of the peers in the round-robin pool.

If it was down for load-related issues, this has the effect of makingit morelikely that it will go down again, because it's temporarily handlingthe load

of the entire pool.

Normally, this isn't a concern, because the number of requests that itcan getout-of-step is relatively small (bounded to how many requests it canbe givenbefore it is considered down -- is this 10 in all cases, or are therecorner

cases?), but in an accelerator case where the origin has a process-based
request-handling model, or back-end processes are CPU-intensive, it is.

It looks like the way to fix this is to call peerClearRR fromneighborAlive inneighbors.c. However, that just clears one peer - it's necessary toclear *all*

peers simultaneously.

Therefore, I sugest:

1) calling peerClearRR from neighborAlive

2) changing the semantics of peerClearRR to clear all neighbours atonce, and

change how it's called appropriately.


--
Mark Nottingham       [EMAIL PROTECTED]

Round-Robin becomes unbalanced when a peer dies and comes back

Reply via email to