[ https://issues.apache.org/jira/browse/MESOS-8609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16399215#comment-16399215 ]
Zhitao Li edited comment on MESOS-8609 at 3/14/18 7:59 PM: ----------------------------------------------------------- commit 82c50c0e00284c131354499f74176b19d89bd21d (HEAD -> master, origin/master, origin/HEAD) Author: Zhitao Li <zhitaoli...@gmail.com> Date: Wed Mar 14 09:25:01 2018 -0700 Document new `slave/recovery_time_secs` gauge. Review: https://reviews.apache.org/r/66070 commit b8526c61403214aaa67fa941b4e8b0fd8e3328f2 Author: Zhitao Li <zhitaoli...@gmail.com> Date: Wed Mar 7 15:18:53 2018 -0800 Added a test to make sure `slave/recovery_time_secs` is reported. Review: https://reviews.apache.org/r/65959 commit 026dafd33cd23d41818e18e31ec271fa2c13abd2 Author: Zhitao Li <zhitaoli...@gmail.com> Date: Tue Mar 6 17:43:48 2018 -0800 Added a gauge for how long agent recovery takes. The new metric `slave/recover_time_secs` can be used to tell us how long Mesos agent needed to finish its recovery cycle. This is an important metric on agent machines which have a lot of completed executor sandboxes. Note that the metric 1) will only be available after recovery succeeded and 2) never change its value across agent process lifecycle afterwards. Review: https://reviews.apache.org/r/65954 was (Author: zhitao): commit 82c50c0e00284c131354499f74176b19d89bd21d (HEAD -> master, origin/master, origin/HEAD) Author: Zhitao Li <zhitaoli...@gmail.com> Date: Wed Mar 14 09:25:01 2018 -0700 Document new `slave/recovery_time_secs` gauge. Review: https://reviews.apache.org/r/66070 commit b8526c61403214aaa67fa941b4e8b0fd8e3328f2 Author: Zhitao Li <zhitaoli...@gmail.com> Date: Wed Mar 7 15:18:53 2018 -0800 Added a test to make sure `slave/recovery_time_secs` is reported. Review: https://reviews.apache.org/r/65959 commit 026dafd33cd23d41818e18e31ec271fa2c13abd2 Author: Zhitao Li <zhitaoli...@gmail.com> Date: Tue Mar 6 17:43:48 2018 -0800 Added a gauge for how long agent recovery takes. The new metric `slave/recover_time_secs` can be used to tell us how long Mesos agent needed to finish its recovery cycle. This is an important metric on agent machines which have a lot of completed executor sandboxes. Note that the metric 1) will only be available after recovery succeeded and 2) never change its value across agent process lifecycle afterwards. Review: https://reviews.apache.org/r/65954 > Create a metric to indicate how long agent takes to recover executors > --------------------------------------------------------------------- > > Key: MESOS-8609 > URL: https://issues.apache.org/jira/browse/MESOS-8609 > Project: Mesos > Issue Type: Improvement > Components: agent > Reporter: Zhitao Li > Assignee: Zhitao Li > Priority: Minor > Labels: Metrics, agent > -- This message was sent by Atlassian JIRA (v7.6.3#76005)