Ottomata has submitted this change and it was merged. Change subject: Re-enable varnishkafka delivery error alerts ......................................................................
Re-enable varnishkafka delivery error alerts This re-enables the icinga alerts for varnishkafka delivery errors that was disabled in https://gerrit.wikimedia.org/r/#/c/138302/1. The alert thresholds have been adjusted, and Kafka has been mostly good since a cluster reinstall on June 24th. Change-Id: Ia7b206c429a50678c46e30c2d93ccd835f5eff62 --- M manifests/role/cache.pp 1 file changed, 11 insertions(+), 0 deletions(-) Approvals: Ottomata: Verified; Looks good to me, approved jenkins-bot: Verified diff --git a/manifests/role/cache.pp b/manifests/role/cache.pp index 5505f69..23ef07a 100644 --- a/manifests/role/cache.pp +++ b/manifests/role/cache.pp @@ -528,6 +528,17 @@ nrpe_command => '/usr/lib/nagios/plugins/check_procs -c 1:1 -C varnishkafka', require => Class['::varnishkafka'], } + + # Generate an alert if too many delivery report errors + monitor_ganglia { 'varnishkafka-drerr': + description => 'Varnishkafka Delivery Errors', + metric => 'kafka.varnishkafka.kafka_drerr.per_second', + # Warn if between more than 0 but less than 30 + warning => '0.1:29.9', + # Critical if greater than 30. + critical => '30.0', + require => Class['::varnishkafka::monitoring'], + } } } -- To view, visit https://gerrit.wikimedia.org/r/150010 To unsubscribe, visit https://gerrit.wikimedia.org/r/settings Gerrit-MessageType: merged Gerrit-Change-Id: Ia7b206c429a50678c46e30c2d93ccd835f5eff62 Gerrit-PatchSet: 1 Gerrit-Project: operations/puppet Gerrit-Branch: production Gerrit-Owner: Ottomata <[email protected]> Gerrit-Reviewer: Ottomata <[email protected]> Gerrit-Reviewer: jenkins-bot <> _______________________________________________ MediaWiki-commits mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits
