Re: [RFC] Have-Digest and duplicate transfer suppression

Amos Jeffries Mon, 15 Aug 2011 16:27:31 -0700

On Mon, 15 Aug 2011 23:17:55 +0200, Henrik Nordström wrote:

mån 2011-08-15 klockan 09:50 -0600 skrev Alex Rousskov:
I do not like aborted retrievals as the default method of handling a
digest-based hit. Aborted transactions have negative side-effectsand
some of those effects are not controlled by Squid (e.g., monitoring
software may trigger an alert if too many requests are aborted).
I agree that we can switch from entities to instances, provided weare
OK with excluding 206, 302, and similar non-200 responses from the
optimization. By instance definition, Squid would not be able tocompute
or use an instance digest if the response is not 200 OK. We can hope
that the vast majority of non-200 responses are either not cachableor
are very small and not worth optimizing.
The bulk bandwidth where you would find duplicates is in positive GET
responses.

Not being able to support 206 duplicate detection without caching the
full 200 in the "topmost" cache is a little annoying however.
> In requests you can optionally add an digest based conditionsimilar to> If-None-Match but here If-None-Match already serves the purposequite> well, so use of the digest condition should probably be limited tocases
> where there is no ETag.

Or to cases where ETag lies about response content changes.
True, but I kind of doubt there is much bandwidth to be found inthose
cases.
> To optimize bandwidth loss due to unneeded transmission a slowstart> mechanism can be used where the sending part waits a couple RTTsbefore> starting to transmit the body of a large response where aninstance> digest is presented. This allows the receiving end to check thereceived> instance digest and abort the request if not interested inreceiving the
> body.
Besides my general dislike for aborted transactions becoming a norm(seeabove), "a couple RTT" delay is a high price to pay because each RTTis
a few seconds already.
Seconds? What kind of network is this?

Satellite (long distance), submarine radio (long wave, low bitrate), orad-hoc ground relay (multiple long distance IP hops).

The RTT details on latter two are mostly classified. But GEO-syncsatellites are publicly documented. A single ground-satellite-groundloop can have close to 1sec RTT at the IP level. With complications suchas triangular routing with a ground-ground uplink that only gets worse.

Also satellites with routers aboard were due to go up sometime over thelast year, so they might also get ground-satellite-satellite-groundloops now. I'm not sure what the real numbers are there, but the earlydays there were things like 2-3 seconds RTT discussed. Mostly due tolow-power requirements, send/receive context switching (!!), or bufferbloat on queuing to cope with bitrates. So reasonable to expect thereare at least some with really crap performance.


Amos

Re: [RFC] Have-Digest and duplicate transfer suppression

Reply via email to