[OPSAWG] OPS-DIR review of draft-ietf-opsawg-vmm-mib-02

Romascanu, Dan (Dan) Thu, 14 May 2015 09:46:35 -0700

I have reviewed this document as part of the Operational directorate's
ongoing effort to review all IETF documents being processed by the IESG.  These
comments were written with the intent of improving the operational aspects of 
the
IETF drafts. Comments that are not addressed in last call may be included in AD 
reviews
during the IESG review.  Document editors and WG chairs should treat these 
comments
just like any other last call comments.



Document Status: Ready with Issues

This is a MIB document - however my review focuses on the perspective of an 
operational and manageability review. I assume that a separate RFC 4181 review 
is or will be performed.

5706 check list:


A.1.  Operational Considerations



   1.  Has deployment been discussed?  See Section 2.1.



       *  Does the document include a description of how this protocol

          or technology is going to be deployed and managed?



       *  Is the proposed specification deployable?  If not, how could

          it be improved?



       *  Does the solution scale well from the operational and

          management perspective?  Does the proposed approach have any

          scaling issues that could affect usability for large-scale

          operation?



       *  Are there any coexistence issues?



DR: there is some discussion about deployment, especially in relation to the 
read-only vs. write capabilities.



   2.  Has installation and initial setup been discussed?  See

       Section 2.2.



       *  Is the solution sufficiently configurable?



       *  Are configuration parameters clearly identified?



       *  Are configuration parameters normalized?



       *  Does each configuration parameter have a reasonable default

          value?



       *  Will configuration be pushed to a device by a configuration

          manager, or pulled by a device from a configuration server?



       *  How will the devices and managers find and authenticate each

          other?



DR: no, but I do not believe that there are any special or different problems 
than the ones encountered when installing any new SNMP agent.



   3.  Has the migration path been discussed?  See Section 2.3.



       *  Are there any backward compatibility issues?



DR: N/A - this is the first version of this MIB module. Relationship with other 
MIB modules is discussed.





   4.  Have the Requirements on other protocols and functional

       components been discussed?  See Section 2.4.

     *  What protocol operations are expected to be performed relative

          to the new protocol or technology, and what protocols and data

          models are expected to be in place or recommended to ensure

          for interoperable management?



DR: N/A



   5.  Has the impact on network operation been discussed?  See

       Section 2.5.



       *  Will the new protocol significantly increase traffic load on

          existing networks?



       *  Will the proposed management for the new protocol

          significantly increase traffic load on existing networks?



       *  How will the new protocol impact the behavior of other

          protocols in the network?  Will it impact performance (e.g.,

          jitter) of certain types of applications running in the same

          network?



       *  Does the new protocol need supporting services (e.g., DNS or

          Authentication, Authorization, and Accounting - AAA) added to

          an existing network?



DR: the number of notifications issued by the agent may be a problem and the 
tuning is quite course. See the more detailed comments.



   6.  Have suggestions for verifying correct operation been discussed?

       See Section 2.6.



       *  How can one test end-to-end connectivity and throughput?



       *  Which metrics are of interest?



       *  Will testing have an impact on the protocol or the network?



DR: N/A



   7.  Has management interoperability been discussed?  See Section 3.1.



       *  Is a standard protocol needed for interoperable management?



       *  Is a standard information or data model needed to make

          properties comparable across devices from different vendors?



DR: this is a standard data model, relationship with other MIB modules is 
discussed



   8.  Are there fault or threshold conditions that should be reported?

       See Section 3.3.



       *  Does specific management information have time utility?



       *  Should the information be reported by notifications?  Polling?

          Event-driven polling?



       *  Is notification throttling discussed?

*  Is there support for saving state that could be used for root

          cause analysis?



DR: standard MIB implementation conditions apply (like system time for time 
object). As in any SNMP agent implementations the operators strategy may be 
notifications-triggered polling, but this is not explicitly mentioned. 
Notification throttling is not applied, and it may be useful - see comment #18





   9.  Is configuration discussed?  See Section 3.4.



       *  Are configuration defaults and default modes of operation

          considered?



       *  Is there discussion of what information should be preserved

          across reboots of the device or the management system?  Can

          devices realistically preserve this information through hard

          reboots where physical configuration might change (e.g., cards

          might be swapped while a chassis is powered down)?





DR: This is a mostly read-only MIB module. Two objects are defined to 
enable/disable notifications, the compliance section does not make them 
mandatory



A.2.  Management Considerations



   Do you anticipate any manageability issues with the specification?



   1.  Is management interoperability discussed?  See Section 3.1.



       *  Will it use centralized or distributed management?



       *  Will it require remote and/or local management applications?



       *  Are textual or graphical user interfaces required?



       *  Is textual or binary format for management information

          preferred?



DR: remote management is possible, this MIB module makes possible interoperable 
management





   2.  Is management information discussed?  See Section 3.2.



       *  What is the minimal set of management (configuration, faults,

          performance monitoring) objects that need to be instrumented

          in order to manage the new protocol?



DR: yes, this is a MIB specification



   3.  Is fault management discussed?  See Section 3.3.



       *  Is Liveness Detection and Monitoring discussed?



       *  Does the solution have failure modes that are difficult to

          diagnose or correct?  Are faults and alarms reported and

          logged?



DR: Yes



   4.  Is configuration management discussed?  See Section 3.4.



       *  Is protocol state information exposed to the user?  How?  Are

          significant state transitions logged?



DR: Yes, configuration management is discussed, but only read operations are 
possible with the exception of enabling and disabling notifications.





5.  Is accounting management discussed?  See Section 3.5.



DR: N/A





   6.  Is performance management discussed?  See Section 3.6.



       *  Does the protocol have an impact on network traffic and

          network devices?  Can performance be measured?



       *  Is protocol performance information exposed to the user?



DR: Yes



   7.  Is security management discussed?  See Section 3.7.



       *  Does the specification discuss how to manage aspects of

          security, such as access controls, managing key distribution,

          etc.
DR: Yes, see detailed comments

A few more detailed comments:


1.       Page 5: s/the MIB objects are managed at a hypervisor/ the MIB objects 
are managed at the hypervisor/

2.       Page 6: s/the relationship between other MIB modules/ s/the 
relationship with other MIB modules/

3.       Page 8: figure 2 - at the top did you mean 'Notation' rather than 
'Notification'?

4.       Same - in general this figure does not really help. It is not clear 
how are the Blocked and Crashed states linked to the rest. I would at least add 
the numbers of the states in the VirtualMachineOperState TC

5.       Page 9: The statement about the MIB module providing writeable objects 
that can be used for non-persistent changes like changing memory allocation or 
CPU allocation seems to be a leftover from previous versions - in any case it 
is not true any longer for this version of the MIB module.

6.       I frankly do not know if the OID tree structure that spreads in pages 
9 to 11 is useful at all. Who needs it? And if anybody needs this, it can be 
obtained from any MIB tool. I believe that a short list of the MIB groups and 
tables and a few words about what each of these does is sufficient.

7.       Page 12: I find the second paragraph in section 5 confusing - maybe 
it's a language issue. The last sentence for example does not makes sense (what 
does 'the objects of this information' mean?)

8.       Same about the 4th paragraph in the same section and page. What does 
the first sentence mean? ('the objects ... are not also included')

9.       Page 15:  the explanation of the destroy(5) value in the 
VirtualMachineAdminState TC may confuse. We do not allow changing VM admin 
states by write operations. Of course, an agent implementing this MIB module 
may read this state and reflect it in an object that uses this SYNTAX. I 
believe that some text about this usage of the TC needs to be added.

10.   Page 20: is there any reason that the TC VirtualMachineStorageAccess has 
no 'unknown' enumeration defined?

11.   Page 20-21: Why is the VirtualMachineStorageType limited to two media 
types only? Would not an IANA-administered TC be more appropriate here?

12.   Page 24: It is not clear to me what is an 'administered region' mentioned 
in the DESCRIPTION clause of the vmUUID object

13.   Page 25: why 'power' in the DESCRIPTION of the vmAdminState?

14.   Page 26: it looks like the intent of the vmMemUnit is to use bytes as 
unit of expressing memory size - this should be explicitly mentioned

15.   Page 33: same comment relative to vmStorageSizeUnit

16.   Page 34-35: I have a big question mark about how the vmStorageReadLatency 
and vmStorageWriteLatency objects need to be used. First the usage of the 
Counter64 syntax seems odd as these objects do not count anything and their 
increase is not unitary. Second I do not know how to interpret them. What does 
say a reading of 'a million' or of 'a billion' mean? It really makes no sense 
if not interpreted in conjunction with the number of respective I/O operations, 
media type, etc. I guess that the implementation of these objects is not 
trivial and probably requires HW support taking into consideration the 
microseconds resolution. At a minimum I guess that some text recommending to 
the operators how they are supposed to be used is needed.

17.   I am concerned that the current way of defining the notifications 
switches is too course. Hypervisors may have many VMs in charge, and if each 
generates one notification per each state changes the numbers can become big 
even in normal operation. Maybe some throttling mechanism would be useful. Or 
maybe a couple of more switches that allow to enable only 'critical' 
notifications (e.g. vmCrashed).

18.   The Security Considerations section does not include a description of the 
security hazards of mis-configuration of the writeable objects (danger of 
flooding the network with unwanted notifications)

Thanks and Regards,

Dan

_______________________________________________
OPSAWG mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/opsawg

[OPSAWG] OPS-DIR review of draft-ietf-opsawg-vmm-mib-02

Reply via email to