[
https://issues.apache.org/jira/browse/MESOS-7819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16127362#comment-16127362
]
Benno Evers commented on MESOS-7819:
------------------------------------
For the metrics where we think they might be occasionally useful for debugging
but are worried about exposing too much internal state (points 1,2,5), maybe
another idea would be to introduce something like private metrics, which would
essentially be something like a {{volatile static int64_t}} (so all
modifications are preserved even at high optimization levels, but the only way
to actually see the value would be through a debugger)
Some thoughts about the individual proposed metrics, it seems to me like any
single one wouldn't be very useful because it's hard to say in isolation how
many actors/connections/messages are "normal" for the different parts of mesos,
but having multiple of them it would become possible to compare their ratios to
known "normal" ranges and maybe pinpoint the fault location more precisely.
In particular, average number of pending messages might be useful not only for
debugging but also for performance regression tests in the future.
> Libprocess internal state is not monitored by metrics.
> ------------------------------------------------------
>
> Key: MESOS-7819
> URL: https://issues.apache.org/jira/browse/MESOS-7819
> Project: Mesos
> Issue Type: Improvement
> Components: libprocess
> Reporter: Alexander Rukletsov
> Labels: metrics, newbie++
>
> Libprocess does not expose its internal state via metrics. Active sockets,
> number of HTTP proxies, number of running actors, number of pending messages
> for all active sockets, etc — may be of interest when monitoring and
> debugging Mesos clusters.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)