Filippo Giunchedi has uploaded a new change for review. https://gerrit.wikimedia.org/r/310539
Change subject: thumbor: increase icinga retries for service units ...................................................................... thumbor: increase icinga retries for service units Failures for OOM are usually legit, though the unit will recover by themselves since Restart=always under normal circumstances. Notify only in cases where thumbor likely has failed to start up at all. Bug: T145623 Change-Id: I52ad9b39b3363a64b5191f383c494465d74d749e --- M modules/thumbor/manifests/instance.pp 1 file changed, 3 insertions(+), 1 deletion(-) git pull ssh://gerrit.wikimedia.org:29418/operations/puppet refs/changes/39/310539/1 diff --git a/modules/thumbor/manifests/instance.pp b/modules/thumbor/manifests/instance.pp index 3c1a052..85ceafb 100644 --- a/modules/thumbor/manifests/instance.pp +++ b/modules/thumbor/manifests/instance.pp @@ -30,5 +30,7 @@ require => File[$instance_service_path], } - nrpe::monitor_systemd_unit_state{ "thumbor@${port}": } + nrpe::monitor_systemd_unit_state{ "thumbor@${port}": + retries => 15, + } } -- To view, visit https://gerrit.wikimedia.org/r/310539 To unsubscribe, visit https://gerrit.wikimedia.org/r/settings Gerrit-MessageType: newchange Gerrit-Change-Id: I52ad9b39b3363a64b5191f383c494465d74d749e Gerrit-PatchSet: 1 Gerrit-Project: operations/puppet Gerrit-Branch: production Gerrit-Owner: Filippo Giunchedi <fgiunch...@wikimedia.org> _______________________________________________ MediaWiki-commits mailing list MediaWiki-commits@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits