[Betacluster-alerts] ** PROBLEM alert - deployment-elastic06/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-elastic06 Address: 10.68.23.242 State: CRITICAL Date/Time: Wed 24 May 05:36:06 UTC 2017 Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-elastic05/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-elastic05 Address: 10.68.20.21 State: CRITICAL Date/Time: Wed 24 May 05:11:45 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mcs01/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mcs01 Address: 10.68.17.18 State: OK Date/Time: Wed 24 May 04:18:09 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-mira/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: OK Date/Time: Wed 24 May 04:05:58 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-mcs01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mcs01 Address: 10.68.17.18 State: CRITICAL Date/Time: Wed 24 May 03:43:08 UTC 2017 Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-urldownloader/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-urldownloader Address: 10.68.16.135 State: OK Date/Time: Wed 24 May 03:30:41 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** PROBLEM alert - deployment-mira/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: CRITICAL Date/Time: Wed 24 May 03:25:57 UTC 2017 Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-logstash2/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: OK Date/Time: Wed 24 May 03:20:48 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-elastic05/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-elastic05 Address: 10.68.20.21 State: OK Date/Time: Wed 24 May 03:20:43 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-conf03/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-conf03 Address: 10.68.20.134 State: OK Date/Time: Wed 24 May 03:11:46 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-urldownloader/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-urldownloader Address: 10.68.16.135 State: CRITICAL Date/Time: Wed 24 May 02:55:43 UTC 2017 Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Wed 24 May 02:56:06 UTC 2017 Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-logstash2/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: CRITICAL Date/Time: Wed 24 May 02:45:47 UTC 2017 Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-aqs01/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: OK Date/Time: Wed 24 May 02:41:27 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-conf03/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-conf03 Address: 10.68.20.134 State: CRITICAL Date/Time: Wed 24 May 02:31:46 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 24 May 02:27:57 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 24 May 02:22:57 UTC 2017 Additional Info: WARNING: 40.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: OK Date/Time: Wed 24 May 02:21:54 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-logstash2/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: OK Date/Time: Wed 24 May 02:19:48 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: CRITICAL Date/Time: Wed 24 May 02:06:25 UTC 2017 Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-restbase02/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-restbase02 Address: 10.68.17.189 State: CRITICAL Date/Time: Wed 24 May 02:05:20 UTC 2017 Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-logstash2/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: CRITICAL Date/Time: Wed 24 May 01:44:47 UTC 2017 Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: CRITICAL Date/Time: Wed 24 May 01:41:54 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-ores-redis-01/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: OK Date/Time: Wed 24 May 01:33:56 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** PROBLEM alert - deployment-ores-redis-01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: CRITICAL Date/Time: Wed 24 May 00:53:55 UTC 2017 Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Tue 23 May 23:41:57 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Tue 23 May 23:16:57 UTC 2017 Additional Info: WARNING: 20.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-prometheus01/Puppet staleness is WARNING **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-prometheus01 Address: 10.68.20.247 State: WARNING Date/Time: Tue 23 May 22:22:56 UTC 2017 Additional Info: WARNING: 20.00% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Main page is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Tue 23 May 22:21:22 UTC 2017 Additional Info: HTTP OK: HTTP/1.1 200 OK - 47136 bytes in 0.912 second response time

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki04/App Server Main HTTP Response is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki04 Address: 10.68.19.128 State: OK Date/Time: Tue 23 May 22:21:16 UTC 2017 Additional Info: HTTP OK: HTTP/1.1 200 OK - 46573 bytes in 1.000 second response time

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Main page is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Tue 23 May 22:16:18 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 2156 bytes in 0.099 second response

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki04/App Server Main HTTP Response is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki04 Address: 10.68.19.128 State: CRITICAL Date/Time: Tue 23 May 22:16:15 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 1592 bytes in 0.098 second response time

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka01/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-kafka01 Address: 10.68.21.219 State: OK Date/Time: Tue 23 May 21:30:07 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Tue 23 May 20:55:06 UTC 2017 Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-db03/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-db03 Address: 10.68.23.30 State: OK Date/Time: Tue 23 May 20:37:02 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-db03/Puppet staleness is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-db03 Address: 10.68.23.30 State: OK Date/Time: Tue 23 May 20:33:37 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-db03/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-db03 Address: 10.68.23.30 State: CRITICAL Date/Time: Tue 23 May 20:22:01 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki06/App Server Main HTTP Response is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki06 Address: 10.68.19.241 State: OK Date/Time: Tue 23 May 20:21:17 UTC 2017 Additional Info: HTTP OK: HTTP/1.1 200 OK - 46525 bytes in 2.305 second response time

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki05/App Server Main HTTP Response is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki05 Address: 10.68.22.21 State: OK Date/Time: Tue 23 May 20:21:19 UTC 2017 Additional Info: HTTP OK: HTTP/1.1 200 OK - 46525 bytes in 3.171 second response time

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki04/App Server Main HTTP Response is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki04 Address: 10.68.19.128 State: OK Date/Time: Tue 23 May 20:19:16 UTC 2017 Additional Info: HTTP OK: HTTP/1.1 200 OK - 46499 bytes in 0.924 second response time

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Main page is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Tue 23 May 20:19:19 UTC 2017 Additional Info: HTTP OK: HTTP/1.1 200 OK - 47102 bytes in 0.822 second response time

[Betacluster-alerts] ** RECOVERY alert - deployment-db04/Puppet staleness is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-db04 Address: 10.68.18.35 State: OK Date/Time: Tue 23 May 20:18:58 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-ores-redis-01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: CRITICAL Date/Time: Tue 23 May 19:56:55 UTC 2017 Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: CRITICAL Date/Time: Tue 23 May 19:47:32 UTC 2017 Additional Info: CRITICAL: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<20.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-db04/Puppet staleness is WARNING **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-db04 Address: 10.68.18.35 State: WARNING Date/Time: Tue 23 May 19:43:56 UTC 2017 Additional Info: WARNING: 60.00% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-db03/Puppet staleness is WARNING **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-db03 Address: 10.68.23.30 State: WARNING Date/Time: Tue 23 May 19:33:37 UTC 2017 Additional Info: WARNING: 44.44% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is WARNING **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: WARNING Date/Time: Tue 23 May 18:52:34 UTC 2017 Additional Info: WARNING: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<60.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki05/App Server Main HTTP Response is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki05 Address: 10.68.22.21 State: CRITICAL Date/Time: Tue 23 May 18:31:20 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 1572 bytes in 1.395 second response time

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki06/App Server Main HTTP Response is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki06 Address: 10.68.19.241 State: CRITICAL Date/Time: Tue 23 May 18:31:15 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 1572 bytes in 1.294 second response time

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki04/App Server Main HTTP Response is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki04 Address: 10.68.19.128 State: CRITICAL Date/Time: Tue 23 May 18:29:16 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 1571 bytes in 0.371 second response time

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Main page is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Tue 23 May 18:29:21 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 2135 bytes in 0.417 second response

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Mobile Main page is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Tue 23 May 18:28:15 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 2135 bytes in 0.420 second

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: OK Date/Time: Tue 23 May 18:20:53 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-urldownloader/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-urldownloader Address: 10.68.16.135 State: OK Date/Time: Tue 23 May 17:59:40 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: CRITICAL Date/Time: Tue 23 May 17:45:54 UTC 2017 Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-prometheus01/Puppet staleness is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-prometheus01 Address: 10.68.20.247 State: OK Date/Time: Tue 23 May 17:26:55 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-urldownloader/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-urldownloader Address: 10.68.16.135 State: CRITICAL Date/Time: Tue 23 May 17:24:41 UTC 2017 Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Tue 23 May 17:10:57 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Tue 23 May 17:00:57 UTC 2017 Additional Info: WARNING: 60.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-restbase02/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-restbase02 Address: 10.68.17.189 State: OK Date/Time: Tue 23 May 16:09:21 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-restbase02/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-restbase02 Address: 10.68.17.189 State: CRITICAL Date/Time: Tue 23 May 15:34:22 UTC 2017 Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: OK Date/Time: Tue 23 May 15:24:54 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-elastic05/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-elastic05 Address: 10.68.20.21 State: OK Date/Time: Tue 23 May 15:19:46 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-aqs01/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: OK Date/Time: Tue 23 May 15:09:26 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-eventlogging03/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-eventlogging03 Address: 10.68.18.111 State: OK Date/Time: Tue 23 May 15:04:49 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** PROBLEM alert - deployment-eventlogging03/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-eventlogging03 Address: 10.68.18.111 State: CRITICAL Date/Time: Tue 23 May 14:54:49 UTC 2017 Additional Info: CRITICAL: 11.11% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: CRITICAL Date/Time: Tue 23 May 14:44:53 UTC 2017 Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: CRITICAL Date/Time: Tue 23 May 14:34:25 UTC 2017 Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka01/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-kafka01 Address: 10.68.21.219 State: OK Date/Time: Tue 23 May 14:34:05 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] Host DOWN alert for deployment-phab02!

2017-05-23 Thread shinken
Notification Type: PROBLEM Host: deployment-phab02 State: DOWN Address: 10.68.19.232 Info: CRITICAL - Host Unreachable (10.68.19.232) Date/Time: Tue 23 May 14:29:35 UTC 2017 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-elastic05/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-elastic05 Address: 10.68.20.21 State: CRITICAL Date/Time: Tue 23 May 14:14:44 UTC 2017 Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-ores-redis-01/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: OK Date/Time: Tue 23 May 14:00:56 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** RECOVERY alert - deployment-mira/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: OK Date/Time: Tue 23 May 13:34:58 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-ores-redis-01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: CRITICAL Date/Time: Tue 23 May 13:25:55 UTC 2017 Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Tue 23 May 13:24:05 UTC 2017 Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-ores-redis-01/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: OK Date/Time: Tue 23 May 12:59:56 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** PROBLEM alert - deployment-ores-redis-01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: CRITICAL Date/Time: Tue 23 May 12:24:56 UTC 2017 Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mira/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: CRITICAL Date/Time: Tue 23 May 12:24:58 UTC 2017 Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-logstash2/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: OK Date/Time: Tue 23 May 12:23:48 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-logstash2/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: CRITICAL Date/Time: Tue 23 May 11:43:47 UTC 2017 Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Mobile Main page is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Tue 23 May 11:42:19 UTC 2017 Additional Info: HTTP OK: HTTP/1.1 200 OK - 33349 bytes in 4.859 second response time

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Mobile Main page is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Tue 23 May 11:37:15 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 503 Service Unavailable - string 'Wikipedia' not found on

[Betacluster-alerts] ** RECOVERY alert - deployment-conf03/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-conf03 Address: 10.68.20.134 State: OK Date/Time: Tue 23 May 10:40:46 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-conf03/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-conf03 Address: 10.68.20.134 State: CRITICAL Date/Time: Tue 23 May 10:05:46 UTC 2017 Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-ores-redis-01/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: OK Date/Time: Tue 23 May 10:03:57 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** RECOVERY alert - deployment-urldownloader/Puppet errors is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-urldownloader Address: 10.68.16.135 State: OK Date/Time: Tue 23 May 10:03:41 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] beta-scap-eqiad - Build # 156507 - Fixed!

2017-05-23 Thread jenkins-bot
beta-scap-eqiad - Build # 156507 - Fixed: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/156507/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-urldownloader/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-urldownloader Address: 10.68.16.135 State: CRITICAL Date/Time: Tue 23 May 09:23:42 UTC 2017 Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] beta-scap-eqiad - Build # 156505 - Failure!

2017-05-23 Thread jenkins-bot
beta-scap-eqiad - Build # 156505 - Failure: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/156505/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-ores-redis-01/Puppet errors is CRITICAL **

2017-05-23 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: CRITICAL Date/Time: Tue 23 May 09:23:56 UTC 2017 Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] beta-scap-eqiad - Build # 156504 - Aborted!

2017-05-23 Thread jenkins-bot
beta-scap-eqiad - Build # 156504 - Aborted: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/156504/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** RECOVERY alert - deployment-fluorine02/Free space - all mounts is OK **

2017-05-23 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: OK Date/Time: Tue 23 May 07:11:31 UTC 2017 Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list