Re: [netmod] AD review: draft-ietf-netmod-revised-datastores-08

Robert Wilton Thu, 21 Dec 2017 06:54:25 -0800


On 21/12/2017 13:03, Vladimir Vassilev wrote:

On 12/21/2017 11:34 AM, Robert Wilton wrote:
Hi Vladimir,
First point of clarification is that this is not aboutrunning/intended at all. The contents of running/intended do notchange in anyway depending on whether hardware is present or absent.
The section is only concerned with how the configuration is appliedin operational, and basically says that you cannot applyconfiguration for resources that are missing (which seemsreasonable). E.g. I cannot configure an IP address on a physicalinterface that isn't there. Or if the physical interface getsremoved then the configuration associated with that interface is alsoremoved from operational.
Operational isn't validated and data model constraints are allowed tobe broken (ideally transiently).
I want to focus on this. IMO giving up schema validitiy for anydatastore is unacceptable price. Pre-NMDA devices had full modelsupport in operational data (all YANG constrains part of the modelwithout discrimination were enforced). If this is about to change itwill compromise interoperability and a significant portion of theclient implementation workload that can be automated will need to becoded in hand and tested.

I don't agree with this. A client can easily see if configuration hasbeen applied by looking at the corresponding data node in theoperational datastore. If <operational> is fully implemented then thisapplies to any data node in the configuration.

Unresolved leafrefs, undefined behaviour of different implementationsremoving different configuration nodes in violation of YANG semanticconstraints (which I do not think can be so clearly separated from thesyntactic constraints when one considers types like leafref,instance-identifier etc.) and the corresponding side effects based onthe server implementators own creativity is eventually going to createmore problems.

I believe that returning the truth is more important that returningfalse information (or no information at all) to satisfy constraints in aschema model.

1. IMO the only acceptable solution is to have YANG valid operationaldatastore at all times. operational like any other datastore MUST bevalid YANG data tree and it has to be a system implementation task toconsider all complications resulting from the removal of the resourcesleading to any data transformations. If this is difficult orimpossible other mechanisms to flag missing resources should be used(e.g. /interfaces/interface/oper-status=not-present) This sounds likea useful contract providing the value of a standard the alternativedoes not.

I think that forcing operational to always be consistent quickly fallsdown as a solution: - In the case of a device with multiple linecards you cannot forcethat they are always lockstep in sync with each other. - In a case of device with multiple concurrent daemon processes, youcannot force that they always all have a consistent view of theoperational state. - When configuration is changing the system may go through transientstates where the operational state is invalid. - The system might run of memory, or daemons crash, or fail, or becomecorrupted, all leaving the system in an invalid operational state.- You can't stop someone from physical removing an piece of hardware,which could immediately make the applied configuration and operationalstate inconsistent (or inaccurate).

If you don't allow invalid states to be reported then you merely preventthe client from querying any operational state from the device whilst itis an inconsistent state. I.e. the making the device harder to manage,and less maintainable.

The "other mechanism" that you describe above sounds like it requires anadhoc solution to this problem for every schema node. The <operational>datastore solves this problem in a generic way. A client can always seewhat configuration is currently applied in the system.

2. Even with the change in 1. I do not see the removal of intendedconfiguration nodes from operational as a solution worth implementingon our servers. I do not see a real world plug-and-play scenario thatcan be automatically solved without specific additions to the modelse.g. /interfaces/interface/oper-status=not-present is oversimplifiedsolution but it needs to be extended exactly as much as the solutionprovided by the removal of config true; nodes without the sacrifice ofYANG validity of operational.

There is somewhat of an implementation choice in this.

The server is obliged to return what is "in use" in operational, but itis up to the server vendor to decide what "in use" actually means for aparticular item of configuration (e.g. for a configured value that mightbe distributed to multiple internal daemons).

3. Solutions like /interfaces/interface/admin-state stop working. Withthe interface removed you can no longer figure if the if-mib has ordoes not have the interface enabled so an operator has to use SNMP orwait for a replacement line card to be connected to figure this bit ofinformation. My interpretation of the MAY as requirement level in sec.5.3. The Operational State Datastore (<operational>) is thatplug-and-play solutions can be implemented without this limitedapproach that has the same problem as the pre-NMDA only now we have tohave /interfaces-state to keep config false; data relevant to hardwarethat is configured but not present:

Admin-status is the intended configuration, clients can always query<running> or <intended> to see the desired state of the system. Theycan also query <operational> and see that the interface doesn'tcurrently exist in the system. The rest of the properties associatedwith the interface sort of seem a bit irrelevant at that point.

/interfaces-state is deprecated and going away. Its direct equivalentis /interfaces in <operational>. I'm not sure that the semanticsbetween the two has necessarily changed very much, except that the NDMAarchitecture now describes a formal mechanism for hardware removal,whereas previously it was left entirely to the vendors to each do theirown thing.

For some of our systems, the applied configuration for a physicalinterface is managed on the linecard hosting that interface. If thatlinecard is pulled out then all of that applied configuration really hasgone. In other cases, a user might remove the optics module for aninterface, in which case the interface still exists but the interface isreported as being operationally down.

   configuration data nodes supported in a configuration datastore
   MAY be omitted from <operational> if a server is not able to
   accurately report them.
I realize this discussion comes late. I have stated my objections tothis particular part of the NMDA draft earlier.

Yes, this discussion does seem very late.

Did you raise your comments during either of the WG LCs? I thought thatI had been pretty diligent tracking and replying to all issues that hadbeen raised during the WG LC.


Thanks,
Rob

Vladimir
But I agree that there could be configuration that is referencingthose missing resources, and depending on implementation then thatconfiguration may need to become not applied as well. Or perhaps thefailure is reported in a different way (e.g. IGP neighbor is down).
I also agree that this is non trivial, but the systems that I amfamiliar with have always had to deal with this issue. At the datamodel level I don't think that this is any more complex than theexisting 'when' statement processing that has exactly the same issuesif a "when" statement becomes invalid during a config change andrequires the associated configuration to be deleted (which again canrecursively require configuration to be removed).
Alternative solutions are:
- mandate that nobody physically removes a linecard if there isstill configuration referencing it, but it is hard to enforce this insoftware :-) - freeze the config from any further changes if a linecard isremoved that makes the config invalid, but this doesn't seem like arobust solution ...
I think that the existing solution is the best approach.

A couple of further comments inline below as well ...

On 20/12/2017 21:44, Vladimir Vassilev wrote:
Hello,

On 12/20/2017 05:40 PM, Benoit Claise wrote:
Dear all,
In order not to be the bottleneck in the process and assuming thatthe document will be in "publication requested" pretty soon, hereis my AD review of draft-ietf-netmod-revised-datastores-08
-


        5.3.2. Missing Resources

    Configuration in <intended> can refer to resources that are not
available or otherwise not physically present. In thesesituations,
    these parts of <intended> are not applied.  The data appears in
    <intended> but does not appear in <operational>.
I have some concerns with this section.
Systems implementing this are expected to remove config true; nodeswhile figuring the necessary changes to ensure the remaining set ofconfig true; nodes in operational validates against the operationaldatastore model. The implementation of this is not a trivial task atall. In order to remove configuration nodes considered inactive onthe fly one needs to remove all references to those nodes inmandatory leafrefs in the best case and a potentially long andcomplex dependency chain of YANG constrain-statements (Xpath etc.)have to be resolved in a worse case. It is difficult to automatethis. It requires significant effort to track and remove/fix allthose dependencies just to come up with valid configuration thatrepresents the configuration without the "inactive" nodes which inmany usecases is completely unjustified implementation effort.
In addition in many cases it is not desirable to remove config true;nodes that depended on a removed resource. For example:
1. A configuration instance of a filter with mandatory interface-refingress and egress ports has to be removed from the operationaldatastore if the egress port is removed as a physical resource. Thisin effect removes the config false; statistics that might be stillof interest counting the matched traffic while the filter does nothave physical egress port to send the packets.
This isn't necessarily true. The architecture does not require thatthe filter object is removed because operational is allowed toviolate the constraints. Ultimately I think that the behaviour herewill depend on implementation.
2. Alarm that is configured with mandatory reference to the missingresource containing a counter of the elapsed time since the resourcewent missing etc.
Again, the draft does not require that the alarm becomes notapplied. This also depends on the implementation.
Thanks,
Rob
I do not find any text in the draft addressing the concerns above. Ido not propose a change yet but I hope to hear what others thinkabout that.
Vladimir
I understand what you want to say.
Let me take an example. I have a router with a Line Card configuredand working well. if I remove the LC, the configuration shouldstill be in the <running> and <intended> but not in <operational>.However, based on figure below, the notion of "inactive" nodesmight be misleading. Indeed, people might read that the LC isinactive, so the LC configuration should not be in <intended>
      +-------------+                 +-----------+
      | <candidate> |                 | <startup> |
      |  (ct, rw)   |<---+       +--->| (ct, rw)  |
      +-------------+    |       |    +-----------+
             |           |       |           |
             |         +-----------+         |
             +-------->| <running> |<--------+
                       | (ct, rw)  |
                       +-----------+
                             |
| // configurationtransformations,
                             |        // e.g., removal of "inactive"
                             |        // nodes, expansion of templates
                             v
                       +------------+
                       | <intended> | // subject to validation
                       | (ct, ro)   |
                       +------------+
I understand that "inactive nodes" has a different meaning.

Proposal:
OLD: removal of "inactive" nodes
NEW: removal of the nodes marked as "inactive"

- In the C.1 example,
    <system
        xmlns="urn:example:system"
xmlns:or="urn:ietf:params:xml:ns:yang:ietf-origin">

      <hostname or:origin="or:dynamic">bar</hostname>

      <interface or:origin="or:intended">
        <name>eth0</name>
        <auto-negotiation>
          <enabled or:origin="or:default">true</enabled>
          <speed>1000</speed>
        </auto-negotiation>
        <speed>100</speed>
        <address>
          <ip>2001:db8::10</ip>
          <prefix-length>64</prefix-length>
        </address>
        <address or:origin="or:dynamic">
          <ip>2001:db8::1:100</ip>
          <prefix-length>64</prefix-length>
        </address>
      </interface>
I guess it "or:dynamic" should be replaced by "or:learned"

Justification:

      identity learned {
        base origin;
        description
"Denotes configuration learned from protocol interactionswith
           other devices, instead of via either the intended
           configuration datastore or any dynamic configuration
           datastore.

           Examples of protocols that provide learned configuration
include link-layer negotiations, routing protocols,_andDHCP._";
_Editorial:_

- number the figures

- section 8.2
    This document registers two YANG modules in the YANG Module Names
registry [RFC6020 <https://tools.ietf.org/html/rfc6020>]. Following the format in [RFC6020<https://tools.ietf.org/html/rfc6020>], the the
    following registrations are requested:

duplicated "the the"
  Regards, Benoit (OPS AD)


_______________________________________________
netmod mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/netmod
_______________________________________________
netmod mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/netmod
.


_______________________________________________
netmod mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/netmod

Re: [netmod] AD review: draft-ietf-netmod-revised-datastores-08

Reply via email to