Re: [PATCH] Do not send unretriable requests on reused pinned connections

Amos Jeffries Fri, 30 Nov 2012 19:27:34 -0800

On 1/12/2012 1:31 p.m., Henrik Nordström wrote:

fre 2012-11-30 klockan 15:30 -0700 skrev Alex Rousskov:

     Squid is sending POST requests on reused pinned connections, and
some of those requests fail due to a pconn race, with no possibility for
a retry.

Yes... and we have to for NTLM, TPROXY and friends or they get in a bit
of trouble from connection state mismatch.

If sending the request fails we should propagate this to the client by
resetting the client connection and let the client retry.


It seems to me we are also forced to do this for ssl-bump connections.

* Opening a new connection is the wrong thing to do for server-firstbumped connections, where the new connection MAY go to a completelydifferent server than the one whose certificate was bumped with. Wecontrol the IP:port we connect to, but we cannot control IP-level loadbalancers existence.* client-first bumped connections do not face the lag, *BUT* there isno way to identify them at forwarding time separately from server-firstbumped.* we are pretending to be a dumb relay - which offers the ironcladguarantee that the server at the other end is a single TCP endpoint (DNSuncertainty is only on the initial setup. Once connected packets reach*an* endpoint they all do or the connection dies).

We can control the outgoing IP:port details, but have no control overthe existence of IP-level load balancers which can screw with thedestination server underneath us. Gambling on the destination notchanging on an HTTPS outbound when retrying for intercepted traffic willre-opening at least two CVE issues 3.2 is supposed to be immune to(CVE-2009-0801 and CVE-2009-3555).

Races are also still very possible on server-bumped connections if forany reason it takes longer to receive+parse+adapt+reparse the clientrequest than the server wants to wait for. Remember we have all the slowtrickle arrival of headers, parsing, adaptation, helpers and accesscontrols to work though before it gets to use the pinned server conn.For example Squid is extremely likely to lose closure races on a mobilenetwork when some big event is on that everyone has togoogle/twitter/facebook about while every request gets bumped and sentthrough an ICAP filter (BBC at the London Olympics).

When using SslBump, the HTTP request is always forwarded using a server
connection "pinned" to the HTTP client connection. Squid does not reuse
a persistent connection from the idle pconn pool for bumped client
requests.

Ok.

  Squid uses the dedicated pinned server connection instead.
This bypasses pconn race controls even though Squid may be essentially
reusing an idle HTTP connection and, hence, may experience the same kind
of race conditions.

Yes..

However, connections that were just pinned, without sending any
requests, are not "essentially reused idle pconns" so we must be careful
to allow unretriable requests on freshly pinned connections.

A straight usage counter is deftinitely the wrong thing to use tocontrol this whether or not you agree with us that re-trying outboundconnections is safe after guaranteeing teh clietn (with encryptioncertificate no less) that a single destinatio has been setup. What isneeded is a suitable length idle timeout and a close handler.Both of which for bumped connections should trigger un-pinning andabort the client connection. If the timouts are not being set onserver-bump pinned connections then that is the bug and needs to befixed ASAP.

The issue is not that the conn was used then pooled versus pinned. Theissue is that async period between last and current packet on the socket- we have no way to identify if the duration between has caused problems(crtd, adaptation or ACL lag might be enough to die from some race withNAT timeouts). Whether that past use was the SSL exchange (server-bumponly) or a previous HTTP data packet. I agree this is just as much trueon bumped connections which were pinned at some unknown time earlier asit is for connections pulled out of a shared pool and last used someunknown time earlier. Regardless of how the persistence was done they*are* essentially reused idle persistent connections. All the samerisks/problems, but whether retry or alternative connection setup ispossible differs greatly between the traffic types - with interceptedtraffic (of any source) the re-try is more dangerous than informing theclient with an aborted connection.

The same logic applies to pinned connection outside SslBump.

Which it quite likely the wrong thing to do. See above.

Regards
Henrik


Amos

Re: [PATCH] Do not send unretriable requests on reused pinned connections

Reply via email to