Re: [dns-privacy] WGLC : draft-ietf-dprive-unilateral-probing

George (Yorgos) Thessalonikefs Fri, 07 Apr 2023 05:56:32 -0700

Hi all,

On 27/03/2023 10:24, Stephane Bortzmeyer wrote:

* Unbound implementation is not ready, but I let Yorgos elaborate on
this point.

The Unbound implementation is far from ready but the hackathon time waswell spent to identify needed changes to Unbound to cleanly supportunilateral probing and to look closely at the draft.

I will continue with the development in the future and report back herewith the results. Some initial notes for that if you are interested:

- The feature is going to be off by default;
- When turned on, the default further probing configuration will be to
  actively probe new servers in an attempt to ease testing;
- Retaining data across reset as per section 4.5 will not be included,
  at least in the initial implementation.

Now on with my comments for the draft, sorry for the wall of text :)


## A - ALPN

In section 4.4 there is mention of ALPN for the resolver (a MUST if Iread it correctly) but there is no mention of ALPN for the authoritativeside in the document.



## B - Resolver source IP

Section 4.5.1 describes keeping state based on the resolver's own sourceIP. This is to support the guidance from section 3.1 where it says:


      To avoid incurring additional minor timeouts for such a recursive
      resolver, the pool operator SHOULD either:
      * ensure that all members of the pool enable the same encrypted
        transport(s) within the span of a few seconds, or
      * ensure that the load balancer maps client requests to pool
        members based on client IP addresses.

My interpretation of this text is that the first bullet point is foroffering the same transport service with a slight hiccup during update,whereas the second bullet point is for offering different transportservices on individual servers of the pool.

The worst case for the former is that the pool is going to be labeled assupporting encryption at most 1 day (damping variable) later, based onwhich servers are reached from the pool.This looks fine for me and no extra state keeping (i.e., resolver ownsource IP) is needed.

I find trying to keep extra state per resolver source IP for the lattercase particularly challenging. Especially if the resolver is notconfigured with explicit outgoing interfaces, thus default route, andneeds to observe its own source address from the reply, which may not beavailable next time around thus giving bind()/send() errors andintroducing retry code paths.All this while the measure does not guarantee to solve thedifferent-transport-service-behind-a-single-IP case as it dependsheavily on the network.I understand that partial rollout is meant to test the waters for anauthoritative operator but I believe using a separate IP for enablingDoT and/or DoQ for testing would make things simpler for both sides.

I don't have an operator's hat but is a pool with variable transportservices something that we actively want to support?



## C - Failure identification
There is mention in the draft about successful and unsuccessful DNS replies.
SERVFAIL is used as an example of an unsuccessful DNS reply.

Following the pseudo code in the draft, a SERVFAIL answer in all thetransports, which IMHO is an already usable DNS answer for the resolver,will make the resolver to wait for all the transport replies beforeconsidering using the SERVFAIL as the final answer.

My opinion is that any RCODE in the reply is a successful DNS answer (ofcourse with matching ID, qname, etc).Otherwise we introduce something like a healthcheck per transport, seewhich transport replies "better" and use that.I believe this aligns with Stephane's observation during the hackathonabout different answers on 53 and 853 and needs addressing in section 3to clearly state that a nameserver's reply to a given query must be thesame regardless of the transport used (maybe not the best text if TC isalso to be considered but I hope I get my message across :)

Maybe also define an unsuccessful "reply" as timeout/connection shutdowninstead of non-preferable RCODEs? There is already logic in resolvers tohandle different RCODEs.

What I am trying to say is to not base the usability of the encryptedtransport on the DNS replies themselves. IMHO as long as there are DNSreplies there, the encrypted transport is usable and preferable.



## D - Wording knit
In sections 4.6.2 and 4.6.9 the following is said:

     If R is successful:
     - Return R to the requesting client

It may well be the case that the R is to an internal query and there isno requesting client waiting for an answer. Would the following work better?


     If R is successful:
     - R is further processed by the resolver


## E - Possible bug

In sections 4.6.2 and 4.6.9 the following is said after receiving asuccessful reply:


    - If Q is in N-queries[X]:
      - Remove Q from N-queries[X]

I believe this is a bug and needs to be removed since future, slowerreplies from the N transport will not be allowed to update the relevantmetrics as section 4.6.9 will stop further processing by the following text:


    If Q is not in E-queries[X]:
    - Discard R and process it no further (do not respond to a encrypted
      response to a query that is not outstanding)

In general I support the idea of the draft but I believe we need to ironout the expectations on both sides, also regarding Florian's recentcomments about per zone answers and thread-intelligence systems behavior.


Thanks for considering and best regards,
-- Yorgos


Some questions were raised about the draft, giving the experience with
PowerDNS Recursor:

* If the ADoT server replies but the reply indicates an error,
   such as SERVFAIL or REFUSED, should the resolver retries without
   DoT? PowerDNS recursor does it, but it seems it would make more
   sense to accept the reply, and just to remind system
   administrators that port 853 and 53 should deliver consistent
   answers. The draft seems clear on the first point (as long as
   there is a properly formatted DNS request, regard the server as
   DoT-enabled) but not on the second (no clear reminder for
   authoritative name servers).

* What should be the criteria to select an authoritative name

   server to query? Should we prefer a fast insecure server or a slow
   encrypted one? The draft does not mention it, because it is local
   policy. (PowerDNS recursor has apparently no way to change its
   default policy, which is to use the fastest one, DoT or
   not.) The draft does not mandate such a knob in the authoritative
   server, again, IETF typically does not tell endpoints how they have
   to be configured.



_______________________________________________
dns-privacy mailing list
dns-privacy@ietf.org
https://www.ietf.org/mailman/listinfo/dns-privacy

Re: [dns-privacy] WGLC : draft-ietf-dprive-unilateral-probing

Reply via email to