[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Thu 04 Jan 07:58:50 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Thu 04 Jan 07:37:51 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka03/Puppet staleness is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-kafka03 Address: 10.68.16.138 State: OK Date/Time: Thu 04 Jan 07:34:54 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-secureredirexperiment/Puppet staleness is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-secureredirexperiment Address: 10.68.17.132 State: CRITICAL Date/Time: Thu 04 Jan 06:51:48 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [43200.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-fluorine02/Free space - all mounts is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: OK Date/Time: Thu 04 Jan 06:51:06 UTC 2018 Notes URLs: Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-ms-be04/Puppet staleness is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-ms-be04 Address: 10.68.16.139 State: CRITICAL Date/Time: Thu 04 Jan 06:49:24 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [43200.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-ms-be03/Puppet staleness is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-ms-be03 Address: 10.68.22.125 State: CRITICAL Date/Time: Thu 04 Jan 06:44:37 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [43200.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki06/App Server Main HTTP Response is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki06 Address: 10.68.19.241 State: OK Date/Time: Thu 04 Jan 06:37:04 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 46419 bytes in 8.605 second response time

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki06/App Server Main HTTP Response is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki06 Address: 10.68.19.241 State: CRITICAL Date/Time: Thu 04 Jan 06:31:53 UTC 2018 Notes URLs: Additional Info: HTTP CRITICAL: HTTP/1.1 503 Service Unavailable - string 'Wikipedia' not found on

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Thu 04 Jan 06:27:51 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Thu 04 Jan 06:20:41 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Thu 04 Jan 06:10:41 UTC 2018 Notes URLs: Additional Info: WARNING: 80.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Thu 04 Jan 06:06:49 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Thu 04 Jan 06:05:42 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Thu 04 Jan 05:55:39 UTC 2018 Notes URLs: Additional Info: WARNING: 80.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka03/Puppet staleness is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-kafka03 Address: 10.68.16.138 State: WARNING Date/Time: Thu 04 Jan 05:54:56 UTC 2018 Notes URLs: Additional Info: WARNING: 40.00% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Thu 04 Jan 05:26:49 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-restbase02/Puppet staleness is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-restbase02 Address: 10.68.17.189 State: CRITICAL Date/Time: Thu 04 Jan 05:22:25 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [43200.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka03/Puppet staleness is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-kafka03 Address: 10.68.16.138 State: OK Date/Time: Thu 04 Jan 05:03:54 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-restbase01/Puppet staleness is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-restbase01 Address: 10.68.16.128 State: CRITICAL Date/Time: Thu 04 Jan 04:59:27 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [43200.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Thu 04 Jan 04:50:42 UTC 2018 Notes URLs: Additional Info: CRITICAL: 80.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Thu 04 Jan 04:45:42 UTC 2018 Notes URLs: Additional Info: WARNING: 80.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Thu 04 Jan 04:05:48 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka03/Puppet staleness is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-kafka03 Address: 10.68.16.138 State: WARNING Date/Time: Thu 04 Jan 03:58:55 UTC 2018 Notes URLs: Additional Info: WARNING: 66.67% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-puppetmaster02/Long lived cherry-picks on puppetmaster is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Long lived cherry-picks on puppetmaster Host: deployment-puppetmaster02 Address: 10.68.21.200 State: CRITICAL Date/Time: Thu 04 Jan 02:30:44 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Thu 04 Jan 02:00:51 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Thu 04 Jan 01:34:50 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka03/Puppet staleness is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-kafka03 Address: 10.68.16.138 State: OK Date/Time: Thu 04 Jan 01:32:55 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka03/Puppet staleness is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-kafka03 Address: 10.68.16.138 State: WARNING Date/Time: Thu 04 Jan 00:27:55 UTC 2018 Notes URLs: Additional Info: WARNING: 55.56% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-sca01 Address: 10.68.20.183 State: OK Date/Time: Thu 04 Jan 00:20:29 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mira/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: OK Date/Time: Thu 04 Jan 00:19:59 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-sca02/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-sca02 Address: 10.68.20.153 State: OK Date/Time: Thu 04 Jan 00:16:08 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-changeprop/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-changeprop Address: 10.68.16.88 State: OK Date/Time: Thu 04 Jan 00:15:47 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Thu 04 Jan 00:14:41 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-cpjobqueue/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cpjobqueue Address: 10.68.22.161 State: OK Date/Time: Thu 04 Jan 00:12:54 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mcs01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mcs01 Address: 10.68.17.18 State: OK Date/Time: Thu 04 Jan 00:05:57 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-sca03/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-sca03 Address: 10.68.21.183 State: OK Date/Time: Thu 04 Jan 00:06:04 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki05/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mediawiki05 Address: 10.68.22.21 State: OK Date/Time: Thu 04 Jan 00:04:33 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki04/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mediawiki04 Address: 10.68.19.128 State: OK Date/Time: Thu 04 Jan 00:03:29 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 03 Jan 23:59:40 UTC 2018 Notes URLs: Additional Info: WARNING: 80.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki06/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mediawiki06 Address: 10.68.19.241 State: OK Date/Time: Wed 03 Jan 23:58:27 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-tin/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: OK Date/Time: Wed 03 Jan 23:54:49 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: OK Date/Time: Wed 03 Jan 23:53:05 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Wed 03 Jan 23:38:49 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka-jumbo-1/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka-jumbo-1 Address: 10.68.23.243 State: CRITICAL Date/Time: Wed 03 Jan 23:37:24 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-netbox/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-netbox Address: 10.68.19.203 State: CRITICAL Date/Time: Wed 03 Jan 23:36:08 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka-jumbo-2/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka-jumbo-2 Address: 10.68.16.87 State: CRITICAL Date/Time: Wed 03 Jan 23:30:56 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Wed 03 Jan 22:58:50 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Wed 03 Jan 22:37:50 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Wed 03 Jan 21:57:51 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: CRITICAL Date/Time: Wed 03 Jan 21:21:06 UTC 2018 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<33.33%)

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Wed 03 Jan 21:06:51 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 03 Jan 20:43:41 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-eventlogging04/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-eventlogging04 Address: 10.68.23.204 State: CRITICAL Date/Time: Wed 03 Jan 20:34:28 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Wed 03 Jan 20:01:50 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Wed 03 Jan 19:53:41 UTC 2018 Notes URLs: Additional Info: CRITICAL: 80.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: CRITICAL Date/Time: Wed 03 Jan 19:49:49 UTC 2018 Notes URLs: Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-etcd-01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-etcd-01 Address: 10.68.19.227 State: OK Date/Time: Wed 03 Jan 19:46:19 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka01/Free space - all mounts is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-kafka01 Address: 10.68.21.219 State: OK Date/Time: Wed 03 Jan 19:42:31 UTC 2018 Notes URLs: Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: WARNING Date/Time: Wed 03 Jan 19:41:06 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<66.67%)

[Betacluster-alerts] ** PROBLEM alert - deployment-cpjobqueue/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cpjobqueue Address: 10.68.22.161 State: CRITICAL Date/Time: Wed 03 Jan 19:37:54 UTC 2018 Notes URLs: Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca02/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-sca02 Address: 10.68.20.153 State: CRITICAL Date/Time: Wed 03 Jan 19:36:11 UTC 2018 Notes URLs: Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Wed 03 Jan 19:35:49 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-sca03/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-sca03 Address: 10.68.21.183 State: CRITICAL Date/Time: Wed 03 Jan 19:31:04 UTC 2018 Notes URLs: Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-tin/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: OK Date/Time: Wed 03 Jan 19:23:50 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] beta-scap-eqiad - Build # 189195 - Still Failing!

2018-01-03 Thread jenkins-bot
beta-scap-eqiad - Build # 189195 - Still Failing: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/189195/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: CRITICAL Date/Time: Wed 03 Jan 19:18:03 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 03 Jan 19:17:44 UTC 2018 Notes URLs: Additional Info: WARNING: 40.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 03 Jan 19:11:40 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] beta-scap-eqiad - Build # 189194 - Failure!

2018-01-03 Thread jenkins-bot
beta-scap-eqiad - Build # 189194 - Failure: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/189194/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-changeprop/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-changeprop Address: 10.68.16.88 State: CRITICAL Date/Time: Wed 03 Jan 19:10:48 UTC 2018 Notes URLs: Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-sca01 Address: 10.68.20.183 State: CRITICAL Date/Time: Wed 03 Jan 19:10:30 UTC 2018 Notes URLs: Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Wed 03 Jan 19:00:50 UTC 2018 Notes URLs: Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka03/Free space - all mounts is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-kafka03 Address: 10.68.16.138 State: CRITICAL Date/Time: Wed 03 Jan 18:54:29 UTC 2018 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-kafka03.diskspace.root.byte_percentfree (<100.00%)

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 03 Jan 18:51:41 UTC 2018 Notes URLs: Additional Info: WARNING: 40.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-mx/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mx Address: 10.68.17.78 State: CRITICAL Date/Time: Wed 03 Jan 18:39:21 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-text04/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cache-text04 Address: 10.68.18.103 State: CRITICAL Date/Time: Wed 03 Jan 18:34:04 UTC 2018 Notes URLs: Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka03/Puppet staleness is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-kafka03 Address: 10.68.16.138 State: OK Date/Time: Wed 03 Jan 18:31:55 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Wed 03 Jan 18:09:50 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Wed 03 Jan 17:29:51 UTC 2018 Notes URLs: Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka03/Puppet staleness is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-kafka03 Address: 10.68.16.138 State: WARNING Date/Time: Wed 03 Jan 17:26:55 UTC 2018 Notes URLs: Additional Info: WARNING: 55.56% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka03/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka03 Address: 10.68.16.138 State: CRITICAL Date/Time: Wed 03 Jan 16:25:02 UTC 2018 Notes URLs: Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Wed 03 Jan 15:38:51 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka03/Puppet staleness is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-kafka03 Address: 10.68.16.138 State: OK Date/Time: Wed 03 Jan 15:35:56 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-redis02/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-redis02 Address: 10.68.16.231 State: CRITICAL Date/Time: Wed 03 Jan 15:33:41 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Wed 03 Jan 14:28:51 UTC 2018 Notes URLs: Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 03 Jan 14:05:40 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 03 Jan 14:00:40 UTC 2018 Notes URLs: Additional Info: WARNING: 20.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Wed 03 Jan 13:37:51 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Free space - all mounts is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Wed 03 Jan 12:57:31 UTC 2018 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-kafka01.diskspace.root.byte_percentfree (<55.56%)

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Mobile Main page is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Wed 03 Jan 12:05:45 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 35469 bytes in 4.628 second response time

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Mobile Main page is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Wed 03 Jan 12:00:41 UTC 2018 Notes URLs: Additional Info: HTTP CRITICAL: HTTP/1.1 503 Service Unavailable - string 'Wikipedia'

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Wed 03 Jan 11:57:48 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Wed 03 Jan 11:36:50 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-trending01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-trending01 Address: 10.68.18.186 State: CRITICAL Date/Time: Wed 03 Jan 11:27:33 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Wed 03 Jan 10:56:49 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-01-03 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Wed 03 Jan 10:35:50 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Free space - all mounts is WARNING **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-kafka01 Address: 10.68.21.219 State: WARNING Date/Time: Wed 03 Jan 10:12:32 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-kafka01.diskspace.root.byte_percentfree (<100.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-01-03 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Wed 03 Jan 08:30:52 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___