[Betacluster-alerts] ** RECOVERY alert - deployment-fluorine02/Free space - all mounts is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: OK Date/Time: Thu 18 Jan 07:12:40 UTC 2018 Notes URLs: Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka01/Free space - all mounts is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-kafka01 Address: 10.68.21.219 State: OK Date/Time: Thu 18 Jan 07:09:43 UTC 2018 Notes URLs: Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki05/Free space - all mounts is WARNING **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki05 Address: 10.68.22.21 State: WARNING Date/Time: Thu 18 Jan 07:04:47 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-mediawiki05.diskspace.root.byte_percentfree (<100.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki05/Free space - all mounts is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki05 Address: 10.68.22.21 State: CRITICAL Date/Time: Thu 18 Jan 06:54:48 UTC 2018 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-mediawiki05.diskspace.root.byte_percentfree (<11.11%)

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki05/App Server Main HTTP Response is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki05 Address: 10.68.22.21 State: OK Date/Time: Thu 18 Jan 01:15:07 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 46843 bytes in 4.788 second response time

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Mobile Main page is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Thu 18 Jan 01:14:42 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 35873 bytes in 0.723 second response time

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki04/App Server Main HTTP Response is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki04 Address: 10.68.19.128 State: OK Date/Time: Thu 18 Jan 01:14:21 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 46797 bytes in 0.663 second response time

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Main page is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Thu 18 Jan 01:14:07 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 47430 bytes in 0.648 second response time

[Betacluster-alerts] ** PROBLEM alert - deployment-puppetmaster02/Long lived cherry-picks on puppetmaster is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Long lived cherry-picks on puppetmaster Host: deployment-puppetmaster02 Address: 10.68.21.200 State: CRITICAL Date/Time: Thu 18 Jan 00:31:15 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-imagescaler02/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-imagescaler02 Address: 10.68.18.233 State: OK Date/Time: Wed 17 Jan 23:20:23 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka-jumbo-2/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-kafka-jumbo-2 Address: 10.68.16.87 State: OK Date/Time: Wed 17 Jan 23:19:49 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca01/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-sca01 Address: 10.68.20.183 State: OK Date/Time: Wed 17 Jan 23:18:50 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-sca02/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-sca02 Address: 10.68.20.153 State: OK Date/Time: Wed 17 Jan 23:15:46 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mira/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: OK Date/Time: Wed 17 Jan 22:52:05 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-db04/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-db04 Address: 10.68.18.35 State: OK Date/Time: Wed 17 Jan 22:52:07 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-memc06/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-memc06 Address: 10.68.22.239 State: OK Date/Time: Wed 17 Jan 22:49:29 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka05/Puppet staleness is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-kafka05 Address: 10.68.21.106 State: OK Date/Time: Wed 17 Jan 22:48:32 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-elastic05/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-elastic05 Address: 10.68.20.21 State: OK Date/Time: Wed 17 Jan 22:48:12 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-changeprop/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-changeprop Address: 10.68.16.88 State: OK Date/Time: Wed 17 Jan 22:48:43 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-cassandra3-01/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cassandra3-01 Address: 10.68.17.103 State: OK Date/Time: Wed 17 Jan 22:46:54 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-cumin/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cumin Address: 10.68.21.105 State: OK Date/Time: Wed 17 Jan 22:47:02 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki07/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mediawiki07 Address: 10.68.17.40 State: OK Date/Time: Wed 17 Jan 22:46:01 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-etcd-01/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-etcd-01 Address: 10.68.19.227 State: OK Date/Time: Wed 17 Jan 22:45:59 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-ms-be04/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-ms-be04 Address: 10.68.16.139 State: OK Date/Time: Wed 17 Jan 22:45:42 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-cpjobqueue/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cpjobqueue Address: 10.68.22.161 State: OK Date/Time: Wed 17 Jan 22:45:51 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-aqs01/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: OK Date/Time: Wed 17 Jan 22:44:57 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-cache-text04/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cache-text04 Address: 10.68.18.103 State: OK Date/Time: Wed 17 Jan 22:44:42 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-parsoid09/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-parsoid09 Address: 10.68.20.142 State: OK Date/Time: Wed 17 Jan 22:43:50 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-redis05/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-redis05 Address: 10.68.19.242 State: OK Date/Time: Wed 17 Jan 22:43:20 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-eventlogging04/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-eventlogging04 Address: 10.68.23.204 State: OK Date/Time: Wed 17 Jan 22:43:25 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-videoscaler01/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-videoscaler01 Address: 10.68.19.130 State: OK Date/Time: Wed 17 Jan 22:41:59 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-memc07/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-memc07 Address: 10.68.17.171 State: OK Date/Time: Wed 17 Jan 22:40:12 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-redis06/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-redis06 Address: 10.68.20.16 State: OK Date/Time: Wed 17 Jan 22:39:17 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-puppetmaster02/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-puppetmaster02 Address: 10.68.21.200 State: OK Date/Time: Wed 17 Jan 22:38:37 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka05/Puppet staleness is WARNING **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-kafka05 Address: 10.68.21.106 State: WARNING Date/Time: Wed 17 Jan 22:38:32 UTC 2018 Notes URLs: Additional Info: WARNING: 10.00% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka-jumbo-2/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka-jumbo-2 Address: 10.68.16.87 State: CRITICAL Date/Time: Wed 17 Jan 22:14:50 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-sca01/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-sca01 Address: 10.68.20.183 State: CRITICAL Date/Time: Wed 17 Jan 22:13:51 UTC 2018 Notes URLs: Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-puppetmaster02/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-puppetmaster02 Address: 10.68.21.200 State: CRITICAL Date/Time: Wed 17 Jan 22:13:37 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-elastic05/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-elastic05 Address: 10.68.20.21 State: CRITICAL Date/Time: Wed 17 Jan 22:13:13 UTC 2018 Notes URLs: Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-db04/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-db04 Address: 10.68.18.35 State: CRITICAL Date/Time: Wed 17 Jan 22:12:05 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mira/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: CRITICAL Date/Time: Wed 17 Jan 22:12:03 UTC 2018 Notes URLs: Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-ms-be04/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ms-be04 Address: 10.68.16.139 State: CRITICAL Date/Time: Wed 17 Jan 22:10:40 UTC 2018 Notes URLs: Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-imagescaler02/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-imagescaler02 Address: 10.68.18.233 State: CRITICAL Date/Time: Wed 17 Jan 22:10:22 UTC 2018 Notes URLs: Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs01/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: CRITICAL Date/Time: Wed 17 Jan 22:09:54 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-memc06/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-memc06 Address: 10.68.22.239 State: CRITICAL Date/Time: Wed 17 Jan 22:09:29 UTC 2018 Notes URLs: Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-changeprop/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-changeprop Address: 10.68.16.88 State: CRITICAL Date/Time: Wed 17 Jan 22:08:43 UTC 2018 Notes URLs: Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-parsoid09/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-parsoid09 Address: 10.68.20.142 State: CRITICAL Date/Time: Wed 17 Jan 22:08:53 UTC 2018 Notes URLs: Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-redis05/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-redis05 Address: 10.68.19.242 State: CRITICAL Date/Time: Wed 17 Jan 22:08:22 UTC 2018 Notes URLs: Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki06/App Server Main HTTP Response is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki06 Address: 10.68.19.241 State: OK Date/Time: Wed 17 Jan 22:07:14 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 46843 bytes in 2.750 second response time

[Betacluster-alerts] ** PROBLEM alert - deployment-cumin/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cumin Address: 10.68.21.105 State: CRITICAL Date/Time: Wed 17 Jan 22:07:02 UTC 2018 Notes URLs: Additional Info: CRITICAL: 11.11% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cassandra3-01/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cassandra3-01 Address: 10.68.17.103 State: CRITICAL Date/Time: Wed 17 Jan 22:06:53 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-videoscaler01/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-videoscaler01 Address: 10.68.19.130 State: CRITICAL Date/Time: Wed 17 Jan 22:07:00 UTC 2018 Notes URLs: Additional Info: CRITICAL: 37.50% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki07/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mediawiki07 Address: 10.68.17.40 State: CRITICAL Date/Time: Wed 17 Jan 22:06:00 UTC 2018 Notes URLs: Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-etcd-01/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-etcd-01 Address: 10.68.19.227 State: CRITICAL Date/Time: Wed 17 Jan 22:05:58 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cpjobqueue/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cpjobqueue Address: 10.68.22.161 State: CRITICAL Date/Time: Wed 17 Jan 22:05:50 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca02/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-sca02 Address: 10.68.20.153 State: CRITICAL Date/Time: Wed 17 Jan 22:05:48 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-memc07/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-memc07 Address: 10.68.17.171 State: CRITICAL Date/Time: Wed 17 Jan 22:05:14 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-text04/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cache-text04 Address: 10.68.18.103 State: CRITICAL Date/Time: Wed 17 Jan 22:04:43 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-redis06/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-redis06 Address: 10.68.20.16 State: CRITICAL Date/Time: Wed 17 Jan 22:04:17 UTC 2018 Notes URLs: Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-eventlogging04/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-eventlogging04 Address: 10.68.23.204 State: CRITICAL Date/Time: Wed 17 Jan 22:03:24 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki06/App Server Main HTTP Response is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki06 Address: 10.68.19.241 State: CRITICAL Date/Time: Wed 17 Jan 22:02:21 UTC 2018 Notes URLs: Additional Info: CRITICAL - Socket timeout after 10 seconds ___

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: CRITICAL Date/Time: Wed 17 Jan 21:57:37 UTC 2018 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<11.11%)

[Betacluster-alerts] ** RECOVERY alert - deployment-poolcounter04/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-poolcounter04 Address: 10.68.17.48 State: OK Date/Time: Wed 17 Jan 20:37:52 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca03/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-sca03 Address: 10.68.21.183 State: OK Date/Time: Wed 17 Jan 20:37:08 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is WARNING **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: WARNING Date/Time: Wed 17 Jan 20:17:38 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<22.22%)

[Betacluster-alerts] ** PROBLEM alert - deployment-poolcounter04/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-poolcounter04 Address: 10.68.17.48 State: CRITICAL Date/Time: Wed 17 Jan 20:02:55 UTC 2018 Notes URLs: Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-sca03/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-sca03 Address: 10.68.21.183 State: CRITICAL Date/Time: Wed 17 Jan 20:02:07 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] Host DOWN alert for deployment-netbox!

2018-01-17 Thread shinken
Notification Type: PROBLEM Host: deployment-netbox State: DOWN Address: 10.68.19.203 Info: CRITICAL - Host Unreachable (10.68.19.203) Date/Time: Wed 17 Jan 20:01:06 UTC 2018 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** RECOVERY alert - deployment-changeprop/Puppet staleness is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-changeprop Address: 10.68.16.88 State: OK Date/Time: Wed 17 Jan 18:47:16 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mx/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mx Address: 10.68.17.78 State: CRITICAL Date/Time: Wed 17 Jan 18:39:21 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-changeprop/Puppet staleness is WARNING **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-changeprop Address: 10.68.16.88 State: WARNING Date/Time: Wed 17 Jan 18:07:14 UTC 2018 Notes URLs: Additional Info: WARNING: 22.22% of data above the warning threshold [3600.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-cache-text04/Puppet errors is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cache-text04 Address: 10.68.18.103 State: OK Date/Time: Wed 17 Jan 18:03:43 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Mobile Main page is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Wed 17 Jan 17:24:46 UTC 2018 Notes URLs: Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - string 'Wikipedia'

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki04/App Server Main HTTP Response is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki04 Address: 10.68.19.128 State: CRITICAL Date/Time: Wed 17 Jan 17:24:21 UTC 2018 Notes URLs: Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - string 'Wikipedia' not found on

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Main page is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Wed 17 Jan 17:24:07 UTC 2018 Notes URLs: Additional Info: HTTP CRITICAL: HTTP/1.1 503 Service Unavailable - string 'Wikipedia' not

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki05/Free space - all mounts is WARNING **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki05 Address: 10.68.22.21 State: WARNING Date/Time: Wed 17 Jan 16:24:47 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-mediawiki05.diskspace.root.byte_percentfree (<100.00%)

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki05/Free space - all mounts is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-mediawiki05 Address: 10.68.22.21 State: OK Date/Time: Wed 17 Jan 16:18:47 UTC 2018 Notes URLs: Additional Info: OK: deployment-prep.deployment-mediawiki05.diskspace.root.byte_percentfree (More than half of the

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 17 Jan 15:58:29 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 17 Jan 15:53:29 UTC 2018 Notes URLs: Additional Info: WARNING: 20.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-redis02/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-redis02 Address: 10.68.16.231 State: CRITICAL Date/Time: Wed 17 Jan 15:33:41 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-redis01/Puppet errors is CRITICAL **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-redis01 Address: 10.68.16.177 State: CRITICAL Date/Time: Wed 17 Jan 15:27:43 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki05/Free space - all mounts is WARNING **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki05 Address: 10.68.22.21 State: WARNING Date/Time: Wed 17 Jan 14:53:46 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-mediawiki05.diskspace.root.byte_percentfree (<33.33%)

[Betacluster-alerts] ** RECOVERY alert - deployment-tin/Free space - all mounts is OK **

2018-01-17 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: OK Date/Time: Wed 17 Jan 08:53:41 UTC 2018 Notes URLs: Additional Info: OK: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints found)

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is WARNING **

2018-01-17 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: WARNING Date/Time: Wed 17 Jan 08:43:39 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints