Elukey has uploaded a new change for review. (
https://gerrit.wikimedia.org/r/375957 )
Change subject: role::analytics_cluster::hadoop::master: raise HDFS alarms
thresholds
......................................................................
role::analytics_cluster::hadoop::master: raise HDFS alarms thresholds
Warning at 70% is too conservative, there are legitimate use cases
in which we need to go up to that level of HDFS usage without getting
any alarm.
Change-Id: I2b9212a56b38ad9b4e31aae9b9bbc29ba409287d
---
M modules/role/manifests/analytics_cluster/hadoop/master.pp
1 file changed, 2 insertions(+), 2 deletions(-)
git pull ssh://gerrit.wikimedia.org:29418/operations/puppet
refs/changes/57/375957/1
diff --git a/modules/role/manifests/analytics_cluster/hadoop/master.pp
b/modules/role/manifests/analytics_cluster/hadoop/master.pp
index e3e732a..954ddc7 100644
--- a/modules/role/manifests/analytics_cluster/hadoop/master.pp
+++ b/modules/role/manifests/analytics_cluster/hadoop/master.pp
@@ -117,8 +117,8 @@
description => 'HDFS capacity used percentage',
metric =>
"Hadoop.NameNode.${::hostname}_eqiad_wmnet_9980.Hadoop.NameNode.NameNodeInfo.PercentUsed.mean",
from => '30min',
- warning => 70,
- critical => 80,
+ warning => 85,
+ critical => 90,
percentage => '60',
contact_group => 'analytics',
}
--
To view, visit https://gerrit.wikimedia.org/r/375957
To unsubscribe, visit https://gerrit.wikimedia.org/r/settings
Gerrit-MessageType: newchange
Gerrit-Change-Id: I2b9212a56b38ad9b4e31aae9b9bbc29ba409287d
Gerrit-PatchSet: 1
Gerrit-Project: operations/puppet
Gerrit-Branch: production
Gerrit-Owner: Elukey <[email protected]>
_______________________________________________
MediaWiki-commits mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits