Ottomata has uploaded a new change for review. https://gerrit.wikimedia.org/r/150010
Change subject: Re-enable varnishkafka delivery error alerts ...................................................................... Re-enable varnishkafka delivery error alerts This re-enables the icinga alerts for varnishkafka delivery errors that was disabled in https://gerrit.wikimedia.org/r/#/c/138302/1. The alert thresholds have been adjusted, and Kafka has been mostly good since a cluster reinstall on June 24th. Change-Id: Ia7b206c429a50678c46e30c2d93ccd835f5eff62 --- M manifests/role/cache.pp 1 file changed, 11 insertions(+), 0 deletions(-) git pull ssh://gerrit.wikimedia.org:29418/operations/puppet refs/changes/10/150010/1 diff --git a/manifests/role/cache.pp b/manifests/role/cache.pp index 5505f69..23ef07a 100644 --- a/manifests/role/cache.pp +++ b/manifests/role/cache.pp @@ -528,6 +528,17 @@ nrpe_command => '/usr/lib/nagios/plugins/check_procs -c 1:1 -C varnishkafka', require => Class['::varnishkafka'], } + + # Generate an alert if too many delivery report errors + monitor_ganglia { 'varnishkafka-drerr': + description => 'Varnishkafka Delivery Errors', + metric => 'kafka.varnishkafka.kafka_drerr.per_second', + # Warn if between more than 0 but less than 30 + warning => '0.1:29.9', + # Critical if greater than 30. + critical => '30.0', + require => Class['::varnishkafka::monitoring'], + } } } -- To view, visit https://gerrit.wikimedia.org/r/150010 To unsubscribe, visit https://gerrit.wikimedia.org/r/settings Gerrit-MessageType: newchange Gerrit-Change-Id: Ia7b206c429a50678c46e30c2d93ccd835f5eff62 Gerrit-PatchSet: 1 Gerrit-Project: operations/puppet Gerrit-Branch: production Gerrit-Owner: Ottomata <[email protected]> _______________________________________________ MediaWiki-commits mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits
