Re: [6man] New Version Notification for draft-nordmark-6man-impatient-nud-00.txt

Ray Hunter Tue, 24 May 2011 11:33:31 -0700

Thanks very much for replying.

I think that I understand the motivation in that multicast is expensiveon some media, and that you thus want to avoid it.

I'm always prepared to be dazzled by my lack of knowledge or incorrectassumptions. I'd much rather ask a dumb question and get a smart answerthan just say nothing.

The idea of the draft seems to be that spending more time performingunicast neighbor solicitations in the "probe" state might avoid deletingthe neighbor entry, and thus the relearning the entry via "expensive"multicast NS from the "-" state.


Seems perfectly reasonable and something worth pursuing.

Erik Nordmark wrote:

Are you assuming that the routers inject host routes into the routingsystem based on the ND state? The routers inject a route for thesubnet prefix which isn't tied to the ND state in any way.

Yes, on the local link the router injects RA information based on a(statically configured or PD learned) prefix. But routers and otherdevices also redistribute reachability information elsewhere via otherprotocols.

The assumption that ND really is independent of everything else is whatI'm questioning myself, although I freely admit to a large dose of handwaving here. ND isn't like ARP, as you of course know.

An ARP cache entry would sit there silently for 4 hours by default anddo nothing, so packets could black hole if the next hop was learned viaa static route. Higher level protocols would have to detect the problemthemselves.

ND removing an entry by NUD probe failure retriggers next hopdetermination, and AFAIK also actively triggers replying to remote nodeswith an ICMP unreachable message, and so ND can thus can effectivelydisseminate reachability information far further than just the local link.

A later post I made gave an example of reachability info beingindirectly based on ND (via a BGP neighbor peering TCP session becomingunreachable due to an ICMPv6 unreachable, leading to route informationchanging). I can imagine the same for EIGRP, OSPF if their neighborsdisappear due to receipt of an ICMPv6 unreachable (althoughtraditionally these implementations have tended to ignore ICMP for goodreason).

Another example sort of device that sometimes transmist reachabilityinformation via TCP are WAN accelerators, that auto build networktunnels, and then send routing information across these. Again, anICMPv6 unreachable might cause the device to tear down the tunnel.

HSRP preference metrics can also potentially be influenced byreachability information (ND) from another link (via track commands).

Then there are also those dreaded silent devices (that we don't talkmuch about but which are generally plonked on the very most criticallink into the main data centre), such as network intrusion detectionsystems and firewalls, that actively monitor traffic across their links,but that don't take part in any official routing protocol exchanges, andcan fail over to a back up system without informing anyone else bymarking interfaces up and down.

Using the example of spanning tree, waiting for STP would probably meanwaiting 35 seconds (max_age + forwarding delay) in the default case forthe root bridge to send out topology notification BPDU's. That's a longtime in many protocols.

So I guess the question is also, do you want NUD to inform higher layersof the need for a fail over ASAP of a local link failure via ICMPunreachables (as Thomas seemed to suggest), or do you want ND to shut upand just keep on retrying locally and let those higher layer protocolshit their own time outs and take their own fail over actions?

Current ND seems to go the ASAP route with its 3 second timeout.Historically, ARP seems to go the silent route.

It just feels to me like all nodes on a common link should behave thesame way in this respect (no scientific argument, just raw gut feelingof deja vu, and impending packet storms)

And if all nodes on the link aren't behaving the same way, don't youstill get say 50% of the multicasts as the partner nodes revert to the"-" state by timing out "too fast" for that link type?

Just seems like another reason to have this as a "per link" parameterrather than a "per node" parameter.


Best regards,
RayH
--------------------------------------------------------------------
IETF IPv6 working group mailing list
[email protected]
Administrative Requests: https://www.ietf.org/mailman/listinfo/ipv6
--------------------------------------------------------------------

Re: [6man] New Version Notification for draft-nordmark-6man-impatient-nud-00.txt

Reply via email to