[Betacluster-alerts] ** RECOVERY alert - deployment-sca03/Free space - all mounts is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-sca03 Address: 10.68.21.183 State: OK Date/Time: Thu 12 Oct 05:21:50 UTC 2017 Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-sca03/Free space - all mounts is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-sca03 Address: 10.68.21.183 State: CRITICAL Date/Time: Thu 12 Oct 05:11:49 UTC 2017 Additional Info: CRITICAL: deployment-prep.deployment-sca03.diskspace._srv.byte_percentfree (<10.00%)

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Thu 12 Oct 04:44:48 UTC 2017 Additional Info: WARNING: 80.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: CRITICAL Date/Time: Thu 12 Oct 03:48:59 UTC 2017 Additional Info: CRITICAL: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<50.00%)

[Betacluster-alerts] ** RECOVERY alert - deployment-mira/Puppet errors is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: OK Date/Time: Thu 12 Oct 01:30:51 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is WARNING **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: WARNING Date/Time: Thu 12 Oct 00:38:58 UTC 2017 Additional Info: WARNING: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<50.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: CRITICAL Date/Time: Thu 12 Oct 00:30:26 UTC 2017 Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-tin/Puppet errors is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: OK Date/Time: Thu 12 Oct 00:09:24 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 11 Oct 23:27:48 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 11 Oct 23:22:49 UTC 2017 Additional Info: WARNING: 20.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca03/Free space - all mounts is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-sca03 Address: 10.68.21.183 State: OK Date/Time: Wed 11 Oct 22:30:50 UTC 2017 Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-sca03/Free space - all mounts is WARNING **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-sca03 Address: 10.68.21.183 State: WARNING Date/Time: Wed 11 Oct 22:20:48 UTC 2017 Additional Info: WARNING: deployment-prep.deployment-sca03.diskspace._srv.byte_percentfree (<33.33%)

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 11 Oct 20:46:49 UTC 2017 Additional Info: WARNING: 20.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 11 Oct 20:25:47 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 11 Oct 20:15:47 UTC 2017 Additional Info: WARNING: 20.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Wed 11 Oct 20:10:48 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 11 Oct 20:05:49 UTC 2017 Additional Info: WARNING: 20.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: CRITICAL Date/Time: Wed 11 Oct 19:59:24 UTC 2017 Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-cassandra3-02/Puppet staleness is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-cassandra3-02 Address: 10.68.21.237 State: OK Date/Time: Wed 11 Oct 19:32:01 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-cassandra3-02/Puppet errors is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cassandra3-02 Address: 10.68.21.237 State: OK Date/Time: Wed 11 Oct 19:29:56 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** RECOVERY alert - deployment-cassandra3-01/Puppet errors is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cassandra3-01 Address: 10.68.17.103 State: OK Date/Time: Wed 11 Oct 19:10:09 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** RECOVERY alert - deployment-tin/Puppet errors is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: OK Date/Time: Wed 11 Oct 19:08:24 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-ms-be04/Puppet errors is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-ms-be04 Address: 10.68.16.139 State: OK Date/Time: Wed 11 Oct 18:59:55 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-mx/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mx Address: 10.68.17.78 State: CRITICAL Date/Time: Wed 11 Oct 18:39:21 UTC 2017 Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: CRITICAL Date/Time: Wed 11 Oct 18:33:26 UTC 2017 Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-mira/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: CRITICAL Date/Time: Wed 11 Oct 18:25:51 UTC 2017 Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-ms-be04/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ms-be04 Address: 10.68.16.139 State: CRITICAL Date/Time: Wed 11 Oct 18:24:53 UTC 2017 Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] beta-scap-eqiad - Build # 177074 - Fixed!

2017-10-11 Thread jenkins-bot
beta-scap-eqiad - Build # 177074 - Fixed: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/177074/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] beta-scap-eqiad - Build # 177073 - Failure!

2017-10-11 Thread jenkins-bot
beta-scap-eqiad - Build # 177073 - Failure: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/177073/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] beta-scap-eqiad - Build # 177071 - Fixed!

2017-10-11 Thread jenkins-bot
beta-scap-eqiad - Build # 177071 - Fixed: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/177071/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] beta-scap-eqiad - Build # 177070 - Failure!

2017-10-11 Thread jenkins-bot
beta-scap-eqiad - Build # 177070 - Failure: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/177070/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki05/Free space - all mounts is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-mediawiki05 Address: 10.68.22.21 State: OK Date/Time: Wed 11 Oct 15:56:08 UTC 2017 Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki05/Free space - all mounts is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki05 Address: 10.68.22.21 State: CRITICAL Date/Time: Wed 11 Oct 15:51:09 UTC 2017 Additional Info: CRITICAL: deployment-prep.deployment-mediawiki05.diskspace.root.byte_percentfree (<11.11%)

[Betacluster-alerts] Host DOWN alert for deployment-pdf01!

2017-10-11 Thread shinken
Notification Type: PROBLEM Host: deployment-pdf01 State: DOWN Address: 10.68.16.73 Info: CRITICAL - Host Unreachable (10.68.16.73) Date/Time: Wed 11 Oct 15:34:31 UTC 2017 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Free space - all mounts is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Wed 11 Oct 15:34:03 UTC 2017 Additional Info: CRITICAL: deployment-prep.deployment-kafka01.diskspace.root.byte_percentfree (<100.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-text04/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cache-text04 Address: 10.68.18.103 State: CRITICAL Date/Time: Wed 11 Oct 15:21:50 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-upload04/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cache-upload04 Address: 10.68.18.109 State: CRITICAL Date/Time: Wed 11 Oct 15:10:09 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-cassandra3-02/Puppet staleness is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-cassandra3-02 Address: 10.68.21.237 State: OK Date/Time: Wed 11 Oct 13:51:01 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki07/App Server Main HTTP Response is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki07 Address: 10.68.17.40 State: CRITICAL Date/Time: Wed 11 Oct 11:37:57 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 404 Not Found - string 'Wikipedia' not found on

[Betacluster-alerts] ** PROBLEM alert - deployment-cassandra3-02/Puppet staleness is WARNING **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-cassandra3-02 Address: 10.68.21.237 State: WARNING Date/Time: Wed 11 Oct 10:01:02 UTC 2017 Additional Info: WARNING: 50.00% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cassandra3-01/Puppet staleness is WARNING **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-cassandra3-01 Address: 10.68.17.103 State: WARNING Date/Time: Wed 11 Oct 09:56:34 UTC 2017 Additional Info: WARNING: 22.22% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cassandra3-01/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cassandra3-01 Address: 10.68.17.103 State: CRITICAL Date/Time: Wed 11 Oct 09:05:08 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-trending01/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-trending01 Address: 10.68.18.186 State: CRITICAL Date/Time: Wed 11 Oct 09:02:10 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 11 Oct 08:49:49 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] Host DOWN alert for deployent-cassandra3-01!

2017-10-11 Thread shinken
Notification Type: PROBLEM Host: deployent-cassandra3-01 State: DOWN Address: 10.68.19.232 Info: CRITICAL - Host Unreachable (10.68.19.232) Date/Time: Wed 11 Oct 08:41:54 UTC 2017 ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-cassandra3-02/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cassandra3-02 Address: 10.68.21.237 State: CRITICAL Date/Time: Wed 11 Oct 08:09:55 UTC 2017 Additional Info: CRITICAL: 88.89% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployent-cassandra3-01/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployent-cassandra3-01 Address: 10.68.19.232 State: CRITICAL Date/Time: Wed 11 Oct 08:07:18 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-fluorine02/Free space - all mounts is OK **

2017-10-11 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: OK Date/Time: Wed 11 Oct 07:13:02 UTC 2017 Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Puppet errors is CRITICAL **

2017-10-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Wed 11 Oct 06:57:17 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___