Chun-Hung Hsiao created MESOS-9639:
--------------------------------------
Summary: Make CSI plugin RPC metrics agnostic to CSI versions.
Key: MESOS-9639
URL: https://issues.apache.org/jira/browse/MESOS-9639
Project: Mesos
Issue Type: Task
Components: storage
Reporter: Chun-Hung Hsiao
Assignee: Chun-Hung Hsiao
Currently SLRP provides per-CSI-call metrics, e.g.:
{noformat}
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs/csi.v0.controller.CreateVolume/successes
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs/csi.v0.node.NodeGetId/errors
{noformat}
If we are to continue to provide such fine-grained metrics, when operators
upgrade their CSI plugins to CSI v1, then SLRP would report another set of
metrics for v1, which would be inconvenient to operators.
Also the fine-grained metrics are not very useful for operators, as most
information are highly correlated to per-operation metrics. So most likely
operators would simply aggregate the per-CSI-call metrics for monitoring CSI
plugins, and use per-operation metrics to monitor volume creation/destroy/etc.
So instead of provide such fine-grained metrics, we could just provide a set of
aggregated rpc metrics that are agnostic to CSI versions, such as:
{noformat}
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs_pending
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs_finished
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs_failed
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs_cancelled
{noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)