[
https://issues.apache.org/jira/browse/MESOS-2080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joseph Wu updated MESOS-2080:
-----------------------------
Labels: twitter (was: mesosphere twitter)
> Add master metrics for maintenance.
> -----------------------------------
>
> Key: MESOS-2080
> URL: https://issues.apache.org/jira/browse/MESOS-2080
> Project: Mesos
> Issue Type: Task
> Components: master
> Reporter: Benjamin Mahler
> Assignee: Yong Qiao Wang
> Labels: twitter
>
> We'll need metrics in order to gain visibility into the maintenance
> functionality. This will also allow operators to add alerting on these
> metrics, in particular:
> # Number of scheduled hosts.
> # Number of active windows.
> # Number of expired windows.
> # Number of successful drains.
> # Number of failed drains.
> As an example of an alert guideline, we would want to know the number of
> expired windows as a gauge to ensure that it is not growing excessively. This
> allows alerting to catch when operators are not properly unscheduling
> maintenance once it is complete.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)