[GitHub] [pulsar] sijie commented on issue #6518: health monitoring, alarms

2020-03-26 Thread GitBox
sijie commented on issue #6518: health monitoring, alarms URL: https://github.com/apache/pulsar/issues/6518#issuecomment-604309990 @ilyam8 it is just an example for your reference. For most of the people, they define their alerting rules based on the metrics you can find on

[GitHub] [pulsar] sijie commented on issue #6518: health monitoring, alarms

2020-03-26 Thread GitBox
sijie commented on issue #6518: health monitoring, alarms URL: https://github.com/apache/pulsar/issues/6518#issuecomment-604248488 @ilyam8 the metrics are documented here https://pulsar.apache.org/docs/en/reference-metrics/. the most common metrics to alert on are "message backlog" and