Re: [6man] New Version Notification for draft-nordmark-6man-impatient-nud-00.txt

Ray Hunter Wed, 25 May 2011 10:59:08 -0700

Thanks once again for replying

I think the differences are indeed down to bad implementations ratherthan specification.

Although it is acknowledged in RFC5461 section 4.1 as late as 2009 thatthere are non-compliant implementations in the field where TCP doesreact to soft errors. Some of the kit I work on is a lot older than that.

Also my operational experience of ARP says that dead gateway protectionis not widely supported, and black holing was more the norm than theexception.


Erik Nordmark wrote:

I don't understand why you think all the nodes on a link need to becoordinated in such a way; the Internet protocols are designed to berobust and not assume that all the nodes have the same code andtuning. For instance, we don't require that ARP or TCP on all thenodes on a link to have the same timer values, and things work just fine.
Erik

And if all nodes on the link aren't behaving the same way, don't you
still get say 50% of the multicasts as the partner nodes revert to the
"-" state by timing out "too fast" for that link type?

Just seems like another reason to have this as a "per link" parameter
rather than a "per node" parameter.

Best regards,

RayH

The last point was simply one from an operational perspective. Forgiveme for being such a low level guy.


[side track]

One of my grumbles about IPv6 is that network managers just don't havethe standard/generic tools to be able to tune the behavior of end nodeseffectively. There are quite a lot of host behaviors that are set withlocal preferences and have default values, but which are not coordinatedacross implementations. e.g. dare I mention SLAAC v. DHCPv6.

As a network admin that's just a nightmare to manage in an environmentwhere there are multiple operating systems, guest end nodes, travelingusers, new nodes, old implementations...... half of the implementationsare performing in a way that isn't suitable for your network, but youmight not have admin rights on that end node, and there's no way toprovide the end node with a hint of correct behavior.

Think of a network using "Bring Your Own Device" policy where you do nothave any admin control e.g. no Active Directory.

There's seems to be no (effective) way of network equipment being ableto signal to end nodes what is appropriate behavior for your particularnetwork, compared to the simple existing tools like DHCPv4 options +extensions we are already have today. I'm sure certain SLAAC evangelistswill tell me it's no business of mine to try to manage this at all, andself-configuration is the future. But never mind.

[/side track]

I've just read the RFC covering the (very interesting) mesh under /route over mechanism used in 6LoWPANhttp://tools.ietf.org/html/draft-ietf-6lowpan-nd-16 . Very cool stuff.

Even there it was a requirement that all nodes taking part in thenetwork behave the same way.> The applicability of this specification is limited to LoWPANs whereall nodes on the subnet implement these optimizations in a homogeneous way.

So if the point of this draft is really to limit multicast, then from anoperational perspective don't you want ALL nodes on a link to avoidusing multicast as much as possible?

So if the point of this draft is really to avoid operational problemswith STP thrashing, then from an operational perspective don't you wantALL nodes on a link to avoid timing out too fast as much as possible?

And how do the end nodes know what is appropriate operational behavioron this particular link? Out of scope of the draft ........ ?

If that's really true that the end nodes do not have to behave the sameway then I do not understand why the Reachable Time and the RetransmitTimer are sent in an RA message.

Put it the other way around way: I don't understand then why it wasconsidered so important that all nodes used the same values forReachable Time and Retransmit Timer (for NUD), if it now isn'tconsidered important that they even use the same retry mechanism in theprobe state, or for how long that state can last.

That's all I'm saying. If you perform link-level coordination for oneset of parameters used by NUD, why not this particular one?

Also for debugging, it's just one more thing to look at on that sniffertrace when spending a weekend / evening debugging in a data centre (notmy favorite hobby and something I try to avoid). So is a node notresponding because it is using exponential NUD back off, or is it notresponding because a ND message is being dropped due to spanning treethrashing around, or is it not responding because the end nodeimplementation is plain broken?

Hope this helps clarify where I'm coming from. It's not in any way acriticism of your draft, just a potential pointer to how it could be"improved" from the perspective of someone operational.


regards,
RayH

--------------------------------------------------------------------
IETF IPv6 working group mailing list
[email protected]
Administrative Requests: https://www.ietf.org/mailman/listinfo/ipv6
--------------------------------------------------------------------

Re: [6man] New Version Notification for draft-nordmark-6man-impatient-nud-00.txt

Reply via email to