During the operations directorate early review of 
draft-ietf-rtgwg-segment-routing-ti-lfa 
Gyan Mishra points to a simple pathological network fragment that I think 
deserves wider discussion.

https://datatracker.ietf.org/doc/review-ietf-rtgwg-segment-routing-ti-lfa-11-opsdir-early-mishra-2023-08-25/

I am not aware of any response to the RTGWG by the draft authors concerning the 
review comment and I cannot see obvious new text addressing this concern.

The fragment is as follows

CE1 –R1- R2-/-R3-CE2
     |         |
     R4 – R5 -R6

In the pre converged network R4 is ECMP CE2 via R5 (cost 4) and via R1 (cost 
also 4).

We can easily build a TI-LFA repair path from R2 under link failure to CE2 (so 
long as we remember that R4 is an ECMP path to CE2), but the problem occurs 
during convergence. If R1 converges before R4, R4 may ECMP packets addressed to 
CE2 back to R1 in a micro loop. Meanwhile since no packets for R3 are reaching 
R2 the Ti-LFA repair is not doing anything useful. 

The Ti-LFA text leads the reader to conclude that it is a loop-free solution, 
but gives no guidance on how to determine when this assumption breaks down. 
There is an informational reference to 
draft-bashandy-rtgwg-segment-routing-uloop, but this short individual draft 
does little in the way of helping the reader determine when  loop avoidance 
strategy needs to be deployed and the loop-free approach it describes does not 
seem to be fully developed.

I am worried that proceeding with the Ti-LFA draft without noting that there is 
a real risk that simple network fragments can micoloop, and providing a fully 
formed mitigation strategy is a disservice to the operator community given the 
industry interest in Ti-LDA and the insidious nature of unexpected micro loop 
network transients, I am wondering what the view of the working group is on how 
to proceed.

One approach would be for the Ti-LFA draft to incorporate detailed guidance on 
how to determine the risk of a micro loop in a specific operator network, and 
to provide specific mitigation advice. Another approach would be to  reference 
a developed loop avoidance strategy and recommending its preemptive deployment. 
Another approach would be to make draft-bashandy-rtgwg-segment-routing-uloop a 
normative reference and tie the fate of the two drafts. Another approach would 
be to elaborate on the risks and their manifestations but declare it a 
currently unsolved problem. I am sure there are other options that the WG may 
formulate.

What is the opinion of the working group on how we should proceed with 
draft-ietf-rtgwg-segment-routing-ti-lfa when considering the possible formation 
of micro loops?

- Stewart
_______________________________________________
rtgwg mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/rtgwg

Reply via email to