Re: [Technical Errata Reported] RFC5880 (7240)

Dave Katz Wed, 22 Feb 2023 11:34:54 -0800

Thanks for the background.

I guess the fact of the matter is that, since the issue cannot affect 
interoperability, it’s hard to imagine getting the WG to go through a bunch of 
machinations to go out of its way to fix something that is entirely pedantic.  
In that case I think holding the erratum for update is the right choice.  The 
erratum can describe the ambiguity and the WG can decide what to do about it if 
they find another reason to update the spec...


—Dave


> On Feb 22, 2023, at 11:29 AM, John Scudder <[email protected]> wrote:
> 
> Hi All,
> 
> Regarding Jeff’s "Given the maturity of the feature, I'd suggest sticking to 
> the reality on the ground”, I want to step in and remind folks about what our 
> guidelines are for processing errata [1]. They are fairly narrow, by design. 
> One of the high-order bits of the guidelines is, "Errata are meant to fix 
> "bugs" in the specification and should not be used to change what the 
> community meant when it approved the RFC.”  What I take this to mean in the 
> current context is, if the behavior specified in the RFC has been found to be 
> ‘wrong’ (for whatever definition of ‘wrong’ you choose to apply), an erratum 
> is definitively not the way to correct that. An erratum is to clarify or 
> correct whatever the intent of the RFC was at time of publication. Of course, 
> many RFCs (this one included, it seems) didn’t receive detailed scrutiny of 
> every crevice of the spec before being declared ready for publication, and so 
> it’s not always possible to really say that the consensus was firmly one way 
> or the other. In such cases, I think we have to err on the side of what the 
> words in the spec say.
> 
> About the furthest I’d go in documenting that (part of) the WG now thinks the 
> specified behavior is undesirable, is to note it in a ‘hold for document 
> update’ erratum. That seems reasonable in this case — it can lay out the pros 
> and cons and at least creates an artifact for future implementors to notice.
> 
> The bottom line is that changes to specified behavior require WG and IETF 
> consensus, and that means they require an RFC to update or obsolete the old 
> behavior. This is one of the pointy bits of our process, that RFCs document 
> the consensus at a moment in time, not the evolving consensus.
> 
> Thanks,
> 
> —John
> 
> [1] 
> https://www.ietf.org/about/groups/iesg/statements/processing-errata-ietf-stream/
> 
>> On Feb 22, 2023, at 11:14 AM, Jeffrey Haas <[email protected]> wrote:
>> 
>> 
>> Dave,
>> 
>> Just as a reminder, the context for why this errata is being discussed is 
>> this inquiry:
>> https://mailarchive.ietf.org/arch/msg/rtg-bfd/YIeCo-nQicI_OIcVncYaJM5Zz6c/
>> 
>> More below:
>> 
>> 
>>> On Feb 17, 2023, at 12:04 PM, Dave Katz 
>>> <[email protected]> wrote:
>>>> On Feb 17, 2023, at 8:47 AM, Reshad Rahman <[email protected]> wrote:
>>>> Having the diag field as breadcrumb has been extremely useful indeed. But 
>>>> both ends can store last diag field sent/received, we don't have to keep 
>>>> sending the diag field after the failure has cleared. It seems odd to be 
>>>> sending a diag field which happened e.g. a year ago.
>>> 
>>> That property helped me when debugging my implementation, as it survives 
>>> the restart/reboot of the far end.
>>> 
>>> There is also no timeout that would make sense;  “forever, for small values 
>>> of ‘forever'” is semantically consistent and the only thing that makes 
>>> sense (to me, at least).
>>> 
>>> Resetting it to zero only deletes information (albeit a tiny amount of it) 
>>> and doesn’t help anything;  you know that the session is up, so the 
>>> diagnostic for its most recent transition to non-upness is disambiguated.
>>> 
>>> Debugging broken things is a scramble for bits of data;  leaving a 
>>> breadcrumb is a polite gift.
>> 
>> From my perspective, the breadcrumb is useful to note during the 
>> transitions, and not simply the transitions for the state.  Examples have 
>> been given where the diag is updated as part of a state transition (governed 
>> by normative text in 5880), or transitions that may happen while the session 
>> remains up (e.g. concatenated path down, echo, etc.).  The RFC isn't great 
>> about saying how you clear such things when the state is still Up; 
>> intuitively it's to return to "No Diagnostic".
>> 
>> However, your own leaning, Dave, is "leave it set forever".  Using the above 
>> examples for diag signaling an event while leaving the state up, I don't 
>> think you mean that.
>> 
>> So, again, the interesting breadcrumbs are when Things Change.  Each of 
>> these items is an edge transition of note. If I care about the event, I care 
>> about it when it happens and will remember it.  I'm not going to look at 
>> diag to reflect this forever.
>> 
>>> 
>>>> 
>>>> Also the text in 6.8.1 says "The diagnostic code specifying the reason for 
>>>> the most recent change in the local session state.". To me that means 
>>>> resetting bfd.LocalDiag to 0 when the state changes to Up.
>>> 
>>> Thus the language that needs fixing (I know, I wrote it...)
>>> 
>>>> 
>>>> AFAIK IOS-XR and JunOS reset LocalDiag. It'd be good to hear from other 
>>>> implementations.
>>> 
>>> Probably.  I might have even coded it that way 20 years ago, or someone 
>>> else did later, thus underscoring the largely-irrelevant nature of this 
>>> discussion...
>> 
>> I did confirm with Juniper's BFD developers that it's reset to 0 when we 
>> transition to Up.  
>> 
>> Given the maturity of the feature, I'd suggest sticking to the reality on 
>> the ground.
>> 
>> -- Jeff
>> 
>

Re: [Technical Errata Reported] RFC5880 (7240)

Reply via email to