Filippo Giunchedi has uploaded a new change for review.

  https://gerrit.wikimedia.org/r/310539

Change subject: thumbor: increase icinga retries for service units
......................................................................

thumbor: increase icinga retries for service units

Failures for OOM are usually legit, though the unit will recover by themselves
since Restart=always under normal circumstances. Notify only in cases where
thumbor likely has failed to start up at all.

Bug: T145623
Change-Id: I52ad9b39b3363a64b5191f383c494465d74d749e
---
M modules/thumbor/manifests/instance.pp
1 file changed, 3 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.wikimedia.org:29418/operations/puppet 
refs/changes/39/310539/1

diff --git a/modules/thumbor/manifests/instance.pp 
b/modules/thumbor/manifests/instance.pp
index 3c1a052..85ceafb 100644
--- a/modules/thumbor/manifests/instance.pp
+++ b/modules/thumbor/manifests/instance.pp
@@ -30,5 +30,7 @@
         require  => File[$instance_service_path],
     }
 
-    nrpe::monitor_systemd_unit_state{ "thumbor@${port}": }
+    nrpe::monitor_systemd_unit_state{ "thumbor@${port}":
+        retries => 15,
+    }
 }

-- 
To view, visit https://gerrit.wikimedia.org/r/310539
To unsubscribe, visit https://gerrit.wikimedia.org/r/settings

Gerrit-MessageType: newchange
Gerrit-Change-Id: I52ad9b39b3363a64b5191f383c494465d74d749e
Gerrit-PatchSet: 1
Gerrit-Project: operations/puppet
Gerrit-Branch: production
Gerrit-Owner: Filippo Giunchedi <fgiunch...@wikimedia.org>

_______________________________________________
MediaWiki-commits mailing list
MediaWiki-commits@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits

Reply via email to