[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Fri 06 Oct 04:44:49 UTC 2017 Additional Info: WARNING: 80.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: CRITICAL Date/Time: Fri 06 Oct 04:38:00 UTC 2017 Additional Info: CRITICAL: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<30.00%)

[Betacluster-alerts] ** RECOVERY alert - deployment-tin/Puppet errors is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: OK Date/Time: Fri 06 Oct 00:30:25 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Puppet errors is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-tin Address: 10.68.21.205 State: CRITICAL Date/Time: Fri 06 Oct 00:00:24 UTC 2017 Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Mobile Main page is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Thu 05 Oct 18:13:34 UTC 2017 Additional Info: HTTP OK: HTTP/1.1 200 OK - 35332 bytes in 0.991 second response time

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Mobile Main page is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Thu 05 Oct 18:08:43 UTC 2017 Additional Info: CRITICAL - Socket timeout after 10 seconds

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki04/Free space - all mounts is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-mediawiki04 Address: 10.68.19.128 State: OK Date/Time: Thu 05 Oct 15:48:54 UTC 2017 Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki04/Free space - all mounts is WARNING **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki04 Address: 10.68.19.128 State: WARNING Date/Time: Thu 05 Oct 15:38:54 UTC 2017 Additional Info: WARNING: deployment-prep.deployment-mediawiki04.diskspace.root.byte_percentfree (<10.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-text04/Puppet errors is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cache-text04 Address: 10.68.18.103 State: CRITICAL Date/Time: Thu 05 Oct 15:21:50 UTC 2017 Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-upload04/Puppet errors is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cache-upload04 Address: 10.68.18.109 State: CRITICAL Date/Time: Thu 05 Oct 15:10:09 UTC 2017 Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-imagescaler01/Puppet errors is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-imagescaler01 Address: 10.68.19.158 State: OK Date/Time: Thu 05 Oct 14:39:37 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki04/Free space - all mounts is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-mediawiki04 Address: 10.68.19.128 State: OK Date/Time: Thu 05 Oct 14:27:54 UTC 2017 Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki04/Free space - all mounts is WARNING **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki04 Address: 10.68.19.128 State: WARNING Date/Time: Thu 05 Oct 14:17:53 UTC 2017 Additional Info: WARNING: deployment-prep.deployment-mediawiki04.diskspace.root.byte_percentfree (<10.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-imagescaler01/Puppet errors is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-imagescaler01 Address: 10.68.19.158 State: CRITICAL Date/Time: Thu 05 Oct 13:59:38 UTC 2017 Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki07/App Server Main HTTP Response is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki07 Address: 10.68.17.40 State: CRITICAL Date/Time: Thu 05 Oct 11:37:57 UTC 2017 Additional Info: HTTP CRITICAL: HTTP/1.1 404 Not Found - string 'Wikipedia' not found on

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Thu 05 Oct 11:07:48 UTC 2017 Additional Info: WARNING: 20.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Thu 05 Oct 10:51:49 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Thu 05 Oct 10:46:48 UTC 2017 Additional Info: WARNING: 20.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-cache-upload04/Puppet errors is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cache-upload04 Address: 10.68.18.109 State: OK Date/Time: Thu 05 Oct 09:19:06 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka-jumbo-2/Puppet errors is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-kafka-jumbo-2 Address: 10.68.16.87 State: OK Date/Time: Thu 05 Oct 09:11:56 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka-jumbo-1/Puppet errors is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-kafka-jumbo-1 Address: 10.68.23.243 State: OK Date/Time: Thu 05 Oct 09:10:25 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** PROBLEM alert - deployment-trending01/Puppet errors is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-trending01 Address: 10.68.18.186 State: CRITICAL Date/Time: Thu 05 Oct 09:02:01 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-cache-text04/Puppet errors is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cache-text04 Address: 10.68.18.103 State: OK Date/Time: Thu 05 Oct 09:00:49 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Thu 05 Oct 08:49:47 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Thu 05 Oct 07:44:48 UTC 2017 Additional Info: WARNING: 80.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Thu 05 Oct 07:39:49 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-fluorine02/Free space - all mounts is OK **

2017-10-05 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: OK Date/Time: Thu 05 Oct 07:06:59 UTC 2017 Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Puppet errors is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Thu 05 Oct 06:57:17 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Thu 05 Oct 06:44:47 UTC 2017 Additional Info: WARNING: 80.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2017-10-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Thu 05 Oct 06:39:48 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [10.0]