Hi Bruno, And best wishes all for this new year.
It seems that posting your txt back as attachment to the list does not work so I will put my comments on the text directly in the email : - In §2.2 : you keep the statement "The following ranges are used:" empty, may be it would be better to change with something like : "None of the ranges are used in this case" - §2.6 : It should be mentioned that this kind of behavior should never happen because the sender must have local configuration procedures to avoid this kind of behavior. But in case there is something going wrong and such advertisement is received, routers must have a consistent behavior. IMO, as the local implementation must avoid this kind of advertisement (and I think we need to state this somewhere), drop all sounds the best option because as this might be a bug, nothing says that the forwarding is correctly setup on the node sending the bad advertisement. - §3. : IGP LSDB may be too restrictive, as depending of the label management implementation, there might be cross protocol conflicts of SIDs. - §3.2.1 : [SLI] Maybe we should mention that this kind of conflict has a "limited" impact. On the LSR, we have more LFIB consumption but no traffic impact, as we are creating multiple parallel LSPs towards the same destination. At the iLSR side, there might be some churn in the FIB (IP to MPLS entry) as it will have to choose one or the other advertisement in order to program the entry. - §3.2.2 : [SLI] As previous comment, we may need to mention the impact of such conflict : inconsistent routing (even possible loops) if there is disagreement between routers (LSRs). - §3.3 : [SLI] We must mention, that traffic is dropped for the destination affected by the conflict. So if a new advertisement is received after a while leading to a conflict, traffic will be blackholed for the affected destination which is IMO not a good option. - §3.4 : [SLI] What happens if BGP sends the same SID ? -§3.5 : [SLI] Maybe we need to remind that we already have a tiebreaking rule between MS entries and Prefix SIDs : "For a given prefix, if both a MS entry with its Prefix-SID Sub-TLV and a Prefix TLV (e.g.: TLV135) with its Prefix-SID are received, the Prefix-SID advertised within the Prefix TLV MUST be preferred while the MS entry MUST be ignored." [SLI] Giving a preference to MS entries may be a very good idea from an operational point of view, especially if someone wants to migrate mappings from one range to another for example. This discussion is out of scope but we may need to think about it. - there are multiple small typo issues in SRGB examples : e.g. Range 3: (500, 5990 <=> Range 3: (500, 599) ; Range 1: (100, 199] <=> Range 1: (100, 199) -----Original Message----- From: spring [mailto:[email protected]] On Behalf Of [email protected] Sent: Thursday, December 17, 2015 18:27 To: [email protected]; Les Ginsberg (ginsberg) Subject: [spring] draft-ginsberg-spring-conflict-resolution Hi Les, authors, WG As an individual contributor, please find below some more detailed comments and considerations on the draft. Following the "please send text" request expressed during the meeting, please find enclosed some proposed text. (xml, txt, diff versus public version). I wished I had sent this before, but writing the text took longer than expected. BTW I still not happy with my text, but hopefully current text (or at least the table of content) should be enough to give an idea on the direction I have in mind. Thanks, Regards, Bruno __ As previous expressed on the mailing list and during the meeting, I'm especially concerned with Mapping Server advertisement where a single typo/bug can conflict with many/all SIDs in the network. In this case, I don't think that dropping all traffic to those SIDs is a desirable option. One option is to prefer individual advertisement (prefix SID) over general ones (Mapping Server). i.e. more specific wins. Note that this is the approach currently taken by the IS-IS draft: " For a given prefix, if both a MS entry with its Prefix-SID Sub-TLV and a Prefix TLV (e.g.: TLV135) with its Prefix-SID are received, the Prefix-SID advertised within the Prefix TLV MUST be preferred while the MS entry MUST be ignored." We can probably assume that this is already implemented (by compliant implementations), so I'm not sure to see the value of changing existing implementations if this is to get a more disruptive result for the network. __ Regarding SID conflict, the current draft proposes to drop all conflicting information. Looking at the big picture, this means: the more (Mapping Server) redundancy, the more risk of conflict, the less availability. This is probably not the property that we are looking for. __ I support the comment to consider the error handling work "recently" done in the IDR WG. IMHO, WG and authors did a good work on such a difficult subject (just like for SID conflicts, at the beginning opinions were diverse, and everyone had good reasons). I'm not sure how much reading the end result (RFC 7606) helps in understanding all the trade-off considered during the work. Reading the operational requirements, given that the IGP infrastructure is probably even more important than BGP for network operators/clients/traffic, it may be worthwhile to read BGP error handling operation requirements ( https://tools.ietf.org/html/draft-ietf-grow-ops-reqs-for-bgp-error-handling-07 ) Discussion with the people involved may help. FYI, as for me, the point I learnt are: - when we detect an error on the receiving side, there is a bug. In general, it's difficult to know whether the bug is on the sending side or the receiving side. Clearly, for the receiving side, the first reaction is to blame the sending side. For each error, it's useful to consider both options (i.e. error may be on my/receiving side). And even if the error is on the sending side, the receiver may run the same implementation (so "sender is too buggy to live" may apply to your own implementation i.e. the receiving side). - when we detect an error, it's useful to consider all possible causes and consequences before deciding to make things worse (e.g. killing a session/transit node/prefix). In particular, there may not be a single error but many (SIDs, source routers (especially if the error is on the receiving side)). So it's useful to consider that the decision may be multiplied 10 or 100 times. Clearly there are difference between IGP & BGP: usually more redundancy in BGP (more signaling path (redundant RR) and more AS exit points), more prefixes hence the cost of dropping one prefix is relatively less important, two-way point to point signaling channel, messages are flooded unchanged so people are less likely to shoot the messenger and errors are less likely to be multiplied during propagation.... __ "An alternative is to ensure at the nodes which originate these advertisements that no such overlap is allowed to be configured. Such overlaps can then be considered as a conflict if they are received. This allows a simpler and more efficient implementation of the database. This is the approach assumed in this document." I encourage implementation to check the configurations before accepting it. And to check its routing advertisement before sending it. I would assume that state of the art implementation will do it. Yet, I don't think it should be considered as an excuse to avoid error handling at the protocol level. BTW, I'm not sure how much you expect implementation checks to be done before committing/sending. e.g. If an operator configure a range on the SRMS, and this ranges conflict with existing SID advertised by other nodes, would you expect the implementation to detect such conflict and reject the configuration? More generally, if we assume some extensive checks by implementations before committing/sending, and that given such assumption we define a light error handling, I'd like such assumptions be documented in the document and probably be made normative (since the receiving side, is expected a specific behavior from the sending side, this is normative). Also "implementation of the database" is implementation dependent and is probably just one aspect of the implementation simplicity/efficiency. And speaking for myself, simplicity/efficiency/availability of 100s networks running segment routing significantly out weight the "simplicity and efficiency" of one to ten implementations. __ "The occurrence of conflicts is easily diagnosed from the behavior of the network as the forwarding of traffic which would, in the absence of conflicts, utilize segments no longer does so." I don't think that we need to kill customer's traffic to raise awareness of the network operator. And I don't support the idea that the more traffic you kill, the easier and faster the error is resolved. If you need to raise an error, please do so! e.g. in logs, syslogs, SNMP traps, netconf event notification... Dropping the traffic is not providing more information, it is increasing the pressure and hence the probability of quick erroneous actions. Not to mention that as all services will run over packet networks, and as there is a pressure to reduce costs we may have a single converged network for all services. In this case, there is a chance that if the network is down, some people/tools/equipment may not be reached anymore/easily. e.g. calling people on their cell phone does not work when the packet network is down. __ " The downside of ignoring conflicting entries is that forwarding of all packets with destinations covered by the conflicting entries will always be negatively impacted." Let's call a spade a spade. (btw, I prefer the French version :s/spade/cat which nowadays should be more popular on the Internet ;-) ) The traffic is not "negatively impacted", the traffic is "dropped". And for all services/customers/BGP routes using theses SIDs. __ 3.2.2. Preference Algorithm [...] "This approach requires that conflicting entries first be identified" All approach requires conflicting entries to be identified. This is not a drawback of the "preference algorithm" "Based on which entry is preferred this in turn may impact what other entries are considered in conflict" This is not additional rule/work. This is applying the same rule/algo to subsequent entries. Also, from the network perspective, in the end, this will not drop more entries than the "ignore rule", quite the contrary. "Based on which entry is preferred this in turn may impact what other entries are considered in conflict i.e. if A conflicts with B and B conflicts with C - it is possible that A does NOT conflict with C. Hence if as a result of the evaluation of the conflict between A and B, entry B is not used the conflict between B and C will not be considered." I'm not sure to get what is implied. As A and B conflicts and B and C conflicts, with "ignore conflicting entries", I would assume that all A, B, C be ignored, no? Or do you plan to ignore conflicting entries until there is no conflict? In which case we would only remove one of the two conflicting entries and end up with a "preference" resolution 'and never drop both conflicting entries since once we drop the first, there is no more conflict) ___ _________________________________________________________________________________________________________________________ Ce message et ses pieces jointes peuvent contenir des informations confidentielles ou privilegiees et ne doivent donc pas etre diffuses, exploites ou copies sans autorisation. Si vous avez recu ce message par erreur, veuillez le signaler a l'expediteur et le detruire ainsi que les pieces jointes. Les messages electroniques etant susceptibles d'alteration, Orange decline toute responsabilite si ce message a ete altere, deforme ou falsifie. Merci. This message and its attachments may contain confidential or privileged information that may be protected by law; they should not be distributed, used or copied without authorisation. If you have received this email in error, please notify the sender and delete this message and its attachments. As emails may be altered, Orange is not liable for messages that have been modified, changed or falsified. Thank you. _________________________________________________________________________________________________________________________ Ce message et ses pieces jointes peuvent contenir des informations confidentielles ou privilegiees et ne doivent donc pas etre diffuses, exploites ou copies sans autorisation. Si vous avez recu ce message par erreur, veuillez le signaler a l'expediteur et le detruire ainsi que les pieces jointes. Les messages electroniques etant susceptibles d'alteration, Orange decline toute responsabilite si ce message a ete altere, deforme ou falsifie. Merci. This message and its attachments may contain confidential or privileged information that may be protected by law; they should not be distributed, used or copied without authorisation. If you have received this email in error, please notify the sender and delete this message and its attachments. As emails may be altered, Orange is not liable for messages that have been modified, changed or falsified. Thank you. _______________________________________________ spring mailing list [email protected] https://www.ietf.org/mailman/listinfo/spring
