[Betacluster-alerts] ** PROBLEM alert - deployment-poolcounter04/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-poolcounter04 Address: 10.68.17.48 State: CRITICAL Date/Time: Sun 03 Jun 05:59:31 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-restbase01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-restbase01 Address: 10.68.16.128 State: CRITICAL Date/Time: Sun 03 Jun 05:58:12 UTC 2018 Notes URLs: Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-snapshot01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-snapshot01 Address: 10.68.19.94 State: CRITICAL Date/Time: Sun 03 Jun 05:57:04 UTC 2018 Notes URLs: Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-conf03/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-conf03 Address: 10.68.20.134 State: CRITICAL Date/Time: Sun 03 Jun 05:57:04 UTC 2018 Notes URLs: Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-apertium02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-apertium02 Address: 10.68.22.254 State: CRITICAL Date/Time: Sun 03 Jun 05:56:38 UTC 2018 Notes URLs: Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-jobrunner03/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-jobrunner03 Address: 10.68.22.109 State: CRITICAL Date/Time: Sun 03 Jun 05:55:36 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-puppetdb02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-puppetdb02 Address: 10.68.19.126 State: CRITICAL Date/Time: Sun 03 Jun 05:55:03 UTC 2018 Notes URLs: Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-ms-be03/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ms-be03 Address: 10.68.22.125 State: CRITICAL Date/Time: Sun 03 Jun 05:53:03 UTC 2018 Notes URLs: Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca04/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-sca04 Address: 10.68.18.80 State: CRITICAL Date/Time: Sun 03 Jun 05:52:32 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-ircd/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ircd Address: 10.68.20.19 State: CRITICAL Date/Time: Sun 03 Jun 05:53:21 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki-09/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mediawiki-09 Address: 10.68.17.159 State: CRITICAL Date/Time: Sun 03 Jun 05:53:16 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-restbase02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-restbase02 Address: 10.68.17.189 State: CRITICAL Date/Time: Sun 03 Jun 05:51:36 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-puppetmaster03/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-puppetmaster03 Address: 10.68.23.29 State: CRITICAL Date/Time: Sun 03 Jun 05:52:20 UTC 2018 Notes URLs: Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-memc05/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-memc05 Address: 10.68.23.49 State: CRITICAL Date/Time: Sun 03 Jun 05:50:58 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-webperf11/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-webperf11 Address: 10.68.19.168 State: OK Date/Time: Sun 03 Jun 05:49:42 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-puppetdb02/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-puppetdb02 Address: 10.68.19.126 State: OK Date/Time: Sun 03 Jun 05:49:04 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-puppetdb02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-puppetdb02 Address: 10.68.19.126 State: CRITICAL Date/Time: Sun 03 Jun 05:39:05 UTC 2018 Notes URLs: Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-webperf11/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-webperf11 Address: 10.68.19.168 State: CRITICAL Date/Time: Sun 03 Jun 05:09:42 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-snapshot01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-snapshot01 Address: 10.68.19.94 State: OK Date/Time: Sun 03 Jun 04:26:05 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-dumps-puppetmaster/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-dumps-puppetmaster Address: 10.68.21.153 State: OK Date/Time: Sun 03 Jun 03:34:25 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-snapshot01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-snapshot01 Address: 10.68.19.94 State: OK Date/Time: Sun 03 Jun 03:25:05 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-dumps-puppetmaster/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-dumps-puppetmaster Address: 10.68.21.153 State: CRITICAL Date/Time: Sun 03 Jun 03:24:26 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is WARNING **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 10.68.23.106 State: WARNING Date/Time: Sun 03 Jun 03:22:34 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<60.00%)

[Betacluster-alerts] ** RECOVERY alert - deployment-restbase02/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-restbase02 Address: 10.68.17.189 State: OK Date/Time: Sun 03 Jun 03:00:36 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-chromium01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-chromium01 Address: 10.68.19.44 State: OK Date/Time: Sun 03 Jun 02:57:14 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-cassandra3-02/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cassandra3-02 Address: 10.68.21.237 State: OK Date/Time: Sun 03 Jun 02:56:14 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: OK Date/Time: Sun 03 Jun 02:54:09 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-imagescaler01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-imagescaler01 Address: 10.68.19.158 State: OK Date/Time: Sun 03 Jun 02:54:19 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-aqs02/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-aqs02 Address: 10.68.17.90 State: OK Date/Time: Sun 03 Jun 02:52:12 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-imagescaler02/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-imagescaler02 Address: 10.68.18.233 State: OK Date/Time: Sun 03 Jun 02:51:47 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-webperf11/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-webperf11 Address: 10.68.19.168 State: OK Date/Time: Sun 03 Jun 02:48:43 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-sca01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-sca01 Address: 10.68.20.183 State: OK Date/Time: Sun 03 Jun 02:48:03 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-aqs01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: OK Date/Time: Sun 03 Jun 02:47:24 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-eventlog05/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-eventlog05 Address: 10.68.18.180 State: OK Date/Time: Sun 03 Jun 02:45:59 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-changeprop/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-changeprop Address: 10.68.16.88 State: OK Date/Time: Sun 03 Jun 02:45:37 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka-main-1/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-kafka-main-1 Address: 10.68.18.219 State: OK Date/Time: Sun 03 Jun 02:44:59 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-cpjobqueue/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cpjobqueue Address: 10.68.22.161 State: OK Date/Time: Sun 03 Jun 02:45:24 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-cassandra3-01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cassandra3-01 Address: 10.68.17.103 State: OK Date/Time: Sun 03 Jun 02:45:08 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca02/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-sca02 Address: 10.68.20.153 State: OK Date/Time: Sun 03 Jun 02:44:53 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-parsoid09/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-parsoid09 Address: 10.68.20.142 State: OK Date/Time: Sun 03 Jun 02:43:19 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-aqs03/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-aqs03 Address: 10.68.17.125 State: OK Date/Time: Sun 03 Jun 02:41:47 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka-main-2/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-kafka-main-2 Address: 10.68.23.182 State: OK Date/Time: Sun 03 Jun 02:41:11 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-zotero01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: OK Date/Time: Sun 03 Jun 02:39:25 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-ores01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-ores01 Address: 10.68.16.235 State: OK Date/Time: Sun 03 Jun 02:39:52 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-logstash2/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: OK Date/Time: Sun 03 Jun 02:39:23 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mcs01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mcs01 Address: 10.68.17.18 State: OK Date/Time: Sun 03 Jun 02:38:30 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-restbase01/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-restbase01 Address: 10.68.16.128 State: OK Date/Time: Sun 03 Jun 02:37:14 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-cumin/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cumin Address: 10.68.21.105 State: OK Date/Time: Sun 03 Jun 02:35:46 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-pdfrender02/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-pdfrender02 Address: 10.68.21.240 State: OK Date/Time: Sun 03 Jun 02:34:22 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] beta-scap-eqiad - Build # 210256 - Fixed!

2018-06-02 Thread jenkins-bot
beta-scap-eqiad - Build # 210256 - Fixed: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/210256/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki06/App Server Main HTTP Response is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki06 Address: 10.68.19.241 State: OK Date/Time: Sun 03 Jun 02:21:27 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 47536 bytes in 1.138 second response time

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki-09/App Server Main HTTP Response is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki-09 Address: 10.68.17.159 State: OK Date/Time: Sun 03 Jun 02:21:00 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 46942 bytes in 0.901 second response time

[Betacluster-alerts] beta-scap-eqiad - Build # 210255 - Still Failing!

2018-06-02 Thread jenkins-bot
beta-scap-eqiad - Build # 210255 - Still Failing: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/210255/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-aqs02 Address: 10.68.17.90 State: CRITICAL Date/Time: Sun 03 Jun 02:17:11 UTC 2018 Notes URLs: Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-imagescaler02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-imagescaler02 Address: 10.68.18.233 State: CRITICAL Date/Time: Sun 03 Jun 02:16:46 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-cassandra3-02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cassandra3-02 Address: 10.68.21.237 State: CRITICAL Date/Time: Sun 03 Jun 02:16:12 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-imagescaler01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-imagescaler01 Address: 10.68.19.158 State: CRITICAL Date/Time: Sun 03 Jun 02:14:18 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-webperf11/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-webperf11 Address: 10.68.19.168 State: CRITICAL Date/Time: Sun 03 Jun 02:13:41 UTC 2018 Notes URLs: Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-sca01 Address: 10.68.20.183 State: CRITICAL Date/Time: Sun 03 Jun 02:13:03 UTC 2018 Notes URLs: Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: CRITICAL Date/Time: Sun 03 Jun 02:12:24 UTC 2018 Notes URLs: Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-eventlog05/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-eventlog05 Address: 10.68.18.180 State: CRITICAL Date/Time: Sun 03 Jun 02:10:58 UTC 2018 Notes URLs: Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cpjobqueue/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cpjobqueue Address: 10.68.22.161 State: CRITICAL Date/Time: Sun 03 Jun 02:10:24 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cassandra3-01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cassandra3-01 Address: 10.68.17.103 State: CRITICAL Date/Time: Sun 03 Jun 02:10:10 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-changeprop/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-changeprop Address: 10.68.16.88 State: CRITICAL Date/Time: Sun 03 Jun 02:10:38 UTC 2018 Notes URLs: Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] beta-scap-eqiad - Build # 210254 - Still Failing!

2018-06-02 Thread jenkins-bot
beta-scap-eqiad - Build # 210254 - Still Failing: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/210254/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-sca02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-sca02 Address: 10.68.20.153 State: CRITICAL Date/Time: Sun 03 Jun 02:09:52 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki-07/App Server Main HTTP Response is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki-07 Address: 10.68.18.62 State: OK Date/Time: Sun 03 Jun 02:09:05 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 46950 bytes in 1.866 second response time

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Mobile Main page is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Sun 03 Jun 02:09:32 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 35876 bytes in 1.687 second response time

[Betacluster-alerts] ** PROBLEM alert - deployment-parsoid09/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-parsoid09 Address: 10.68.20.142 State: CRITICAL Date/Time: Sun 03 Jun 02:08:22 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs03/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-aqs03 Address: 10.68.17.125 State: CRITICAL Date/Time: Sun 03 Jun 02:06:47 UTC 2018 Notes URLs: Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-logstash2/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: CRITICAL Date/Time: Sun 03 Jun 02:04:23 UTC 2018 Notes URLs: Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka-main-1/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka-main-1 Address: 10.68.18.219 State: CRITICAL Date/Time: Sun 03 Jun 02:04:59 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-restbase01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-restbase01 Address: 10.68.16.128 State: CRITICAL Date/Time: Sun 03 Jun 02:02:12 UTC 2018 Notes URLs: Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka-main-2/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka-main-2 Address: 10.68.23.182 State: CRITICAL Date/Time: Sun 03 Jun 02:01:11 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-ores01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ores01 Address: 10.68.16.235 State: CRITICAL Date/Time: Sun 03 Jun 01:59:53 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-zotero01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zotero01 Address: 10.68.17.102 State: CRITICAL Date/Time: Sun 03 Jun 01:59:25 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] beta-scap-eqiad - Build # 210253 - Still Failing!

2018-06-02 Thread jenkins-bot
beta-scap-eqiad - Build # 210253 - Still Failing: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/210253/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-pdfrender02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-pdfrender02 Address: 10.68.21.240 State: CRITICAL Date/Time: Sun 03 Jun 01:59:23 UTC 2018 Notes URLs: Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-mcs01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mcs01 Address: 10.68.17.18 State: CRITICAL Date/Time: Sun 03 Jun 01:58:30 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-restbase02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-restbase02 Address: 10.68.17.189 State: CRITICAL Date/Time: Sun 03 Jun 01:55:36 UTC 2018 Notes URLs: Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Mobile Main page is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Sun 03 Jun 01:54:40 UTC 2018 Notes URLs: Additional Info: CRITICAL - Socket timeout after 10 seconds

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Main page is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Sun 03 Jun 01:54:18 UTC 2018 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 47487 bytes in 3.537 second response time

[Betacluster-alerts] ** PROBLEM alert - deployment-chromium01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-chromium01 Address: 10.68.19.44 State: CRITICAL Date/Time: Sun 03 Jun 01:52:16 UTC 2018 Notes URLs: Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Main page is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Sun 03 Jun 01:49:27 UTC 2018 Notes URLs: Additional Info: CRITICAL - Socket timeout after 10 seconds

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki-07/App Server Main HTTP Response is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki-07 Address: 10.68.18.62 State: CRITICAL Date/Time: Sun 03 Jun 01:49:15 UTC 2018 Notes URLs: Additional Info: CRITICAL - Socket timeout after 10 seconds ___

[Betacluster-alerts] beta-scap-eqiad - Build # 210252 - Failure!

2018-06-02 Thread jenkins-bot
beta-scap-eqiad - Build # 210252 - Failure: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/210252/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mathoid Address: 10.68.23.236 State: CRITICAL Date/Time: Sun 03 Jun 01:49:09 UTC 2018 Notes URLs: Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki-09/App Server Main HTTP Response is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki-09 Address: 10.68.17.159 State: CRITICAL Date/Time: Sun 03 Jun 01:46:09 UTC 2018 Notes URLs: Additional Info: CRITICAL - Socket timeout after 10 seconds

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki06/App Server Main HTTP Response is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki06 Address: 10.68.19.241 State: CRITICAL Date/Time: Sun 03 Jun 01:46:36 UTC 2018 Notes URLs: Additional Info: CRITICAL - Socket timeout after 10 seconds ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cumin/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cumin Address: 10.68.21.105 State: CRITICAL Date/Time: Sun 03 Jun 00:05:45 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] Host DOWN alert for deployment-puppetmaster02!

2018-06-02 Thread shinken
Notification Type: PROBLEM Host: deployment-puppetmaster02 State: DOWN Address: 10.68.21.200 Info: CRITICAL - Host Unreachable (10.68.21.200) Date/Time: Sat 02 Jun 22:30:27 UTC 2018 ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-sca01/Free space - all mounts is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-sca01 Address: 10.68.20.183 State: OK Date/Time: Sat 02 Jun 22:28:47 UTC 2018 Notes URLs: Additional Info: OK: deployment-prep.deployment-sca01.diskspace._var.byte_percentfree (No valid datapoints found)

[Betacluster-alerts] ** PROBLEM alert - deployment-mx/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mx Address: 10.68.17.78 State: CRITICAL Date/Time: Sat 02 Jun 18:39:21 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Sat 02 Jun 18:19:15 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Sat 02 Jun 18:14:15 UTC 2018 Notes URLs: Additional Info: WARNING: 20.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-deploy-01/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-deploy-01 Address: 10.68.22.177 State: CRITICAL Date/Time: Sat 02 Jun 18:12:41 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-certcentral-testclient02/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-certcentral-testclient02 Address: 10.68.16.95 State: OK Date/Time: Sat 02 Jun 17:37:08 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-certcentral-testclient02/Puppet errors is CRITICAL **

2018-06-02 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-certcentral-testclient02 Address: 10.68.16.95 State: CRITICAL Date/Time: Sat 02 Jun 17:27:09 UTC 2018 Notes URLs: Additional Info: CRITICAL: 55.56% of data above the critical threshold [0.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-certcentral-testclient02/Puppet errors is OK **

2018-06-02 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-certcentral-testclient02 Address: 10.68.16.95 State: OK Date/Time: Sat 02 Jun 17:21:07 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] Host DOWN alert for deployment-certcentral!

2018-06-02 Thread shinken
Notification Type: PROBLEM Host: deployment-certcentral State: DOWN Address: 10.68.18.193 Info: CRITICAL - Host Unreachable (10.68.18.193) Date/Time: Sat 02 Jun 17:17:42 UTC 2018 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

  1   2   >