[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Thu 02 May 05:44:03 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.024 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Thu 02 May 05:19:03 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Thu 02 May 05:13:04 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.025 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Thu 02 May 04:53:02 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Thu 02 May 04:27:02 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.025 second response time ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca02/Citoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Citoid Host: deployment-sca02 Address: 172.16.5.112 State: OK Date/Time: Thu 02 May 04:15:09 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 921 bytes in 0.026 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca02/Citoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Citoid Host: deployment-sca02 Address: 172.16.5.112 State: CRITICAL Date/Time: Thu 02 May 04:10:08 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.112 and port 1970: Connection refused ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Thu 02 May 03:32:03 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Thu 02 May 03:16:02 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.023 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is WARNING **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 172.16.5.71 State: WARNING Date/Time: Thu 02 May 03:10:51 UTC 2019 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<50.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Thu 02 May 02:51:03 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] ** PROBLEM alert - deployment-logstash2/Puppet staleness is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-logstash2 Address: 172.16.5.22 State: CRITICAL Date/Time: Thu 02 May 01:03:43 UTC 2019 Notes URLs: Additional Info: CRITICAL: 40.00% of data above the critical threshold [43200.0]

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Thu 02 May 00:04:22 UTC 2019 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 01 May 23:59:22 UTC 2019 Notes URLs: Additional Info: WARNING: 20.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-mwmaint01/Free space - all mounts is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-mwmaint01 Address: 172.16.4.16 State: OK Date/Time: Wed 01 May 20:45:00 UTC 2019 Notes URLs: Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-snapshot01/Free space - all mounts is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-snapshot01 Address: 172.16.4.132 State: OK Date/Time: Wed 01 May 20:43:51 UTC 2019 Notes URLs: Additional Info: OK: deployment-prep.deployment-snapshot01.diskspace._data.byte_percentfree (No valid datapoints

[Betacluster-alerts] ** PROBLEM alert - deployment-mwmaint01/Free space - all mounts is WARNING **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mwmaint01 Address: 172.16.4.16 State: WARNING Date/Time: Wed 01 May 20:34:59 UTC 2019 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-mwmaint01.diskspace.root.byte_percentfree (<10.00%)

[Betacluster-alerts] ** PROBLEM alert - deployment-snapshot01/Free space - all mounts is WARNING **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-snapshot01 Address: 172.16.4.132 State: WARNING Date/Time: Wed 01 May 20:33:48 UTC 2019 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-snapshot01.diskspace._data.byte_percentfree (No valid

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Wed 01 May 18:55:04 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.025 second response time ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca01/Content Translation Server is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Content Translation Server Host: deployment-sca01 Address: 172.16.5.13 State: OK Date/Time: Wed 01 May 18:48:48 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 904 bytes in 0.026 second response time

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Wed 01 May 18:45:04 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca01/Content Translation Server is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Content Translation Server Host: deployment-sca01 Address: 172.16.5.13 State: CRITICAL Date/Time: Wed 01 May 18:43:48 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.13 and port 8080: Connection refused

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Wed 01 May 18:39:05 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.035 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Wed 01 May 18:14:03 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] Host DOWN alert for deployment-poolcounter04!

2019-05-01 Thread Shinken
Notification Type: PROBLEM Host: deployment-poolcounter04 State: DOWN Address: 172.16.5.58 Info: CRITICAL - Host Unreachable (172.16.5.58) Date/Time: Wed 01 May 16:40:44 UTC 2019 ___ Betacluster-alerts mailing list

[Betacluster-alerts] Host DOWN alert for deployment-ms-fe02!

2019-05-01 Thread Shinken
Notification Type: PROBLEM Host: deployment-ms-fe02 State: DOWN Address: 172.16.5.66 Info: CRITICAL - Host Unreachable (172.16.5.66) Date/Time: Wed 01 May 16:40:26 UTC 2019 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Wed 01 May 16:15:02 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.033 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Wed 01 May 16:10:03 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca02/Citoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Citoid Host: deployment-sca02 Address: 172.16.5.112 State: OK Date/Time: Wed 01 May 16:05:06 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 921 bytes in 0.024 second response time ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Wed 01 May 16:04:03 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.041 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca02/Citoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Citoid Host: deployment-sca02 Address: 172.16.5.112 State: CRITICAL Date/Time: Wed 01 May 16:00:06 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.112 and port 1970: Connection refused ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca02/Citoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Citoid Host: deployment-sca02 Address: 172.16.5.112 State: OK Date/Time: Wed 01 May 15:54:09 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 921 bytes in 0.025 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca02/Citoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Citoid Host: deployment-sca02 Address: 172.16.5.112 State: CRITICAL Date/Time: Wed 01 May 15:24:08 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.112 and port 1970: Connection refused ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Wed 01 May 15:19:03 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Wed 01 May 15:08:04 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.025 second response time ___

[Betacluster-alerts] Host DOWN alert for deployment-ms-be04!

2019-05-01 Thread Shinken
Notification Type: PROBLEM Host: deployment-ms-be04 State: DOWN Address: 172.16.4.129 Info: CRITICAL - Host Unreachable (172.16.4.129) Date/Time: Wed 01 May 14:45:18 UTC 2019 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] Host DOWN alert for deployment-ms-be03!

2019-05-01 Thread Shinken
Notification Type: PROBLEM Host: deployment-ms-be03 State: DOWN Address: 172.16.5.51 Info: CRITICAL - Host Unreachable (172.16.5.51) Date/Time: Wed 01 May 14:44:26 UTC 2019 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 01 May 14:08:22 UTC 2019 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Wed 01 May 14:08:04 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 01 May 14:03:21 UTC 2019 Notes URLs: Additional Info: WARNING: 20.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-logstash2/Puppet staleness is WARNING **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-logstash2 Address: 172.16.5.22 State: WARNING Date/Time: Wed 01 May 14:03:41 UTC 2019 Notes URLs: Additional Info: WARNING: 40.00% of data above the warning threshold [3600.0]

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 01 May 13:22:21 UTC 2019 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 01 May 13:17:20 UTC 2019 Notes URLs: Additional Info: WARNING: 20.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-mathoid/Mathoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: OK Date/Time: Wed 01 May 10:30:03 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 925 bytes in 0.026 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mathoid/Mathoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Mathoid Host: deployment-mathoid Address: 172.16.5.73 State: CRITICAL Date/Time: Wed 01 May 10:25:03 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.73 and port 10042: Connection refused ___

[Betacluster-alerts] ** RECOVERY alert - deployment-sca02/Citoid is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Citoid Host: deployment-sca02 Address: 172.16.5.112 State: OK Date/Time: Wed 01 May 09:28:06 UTC 2019 Notes URLs: Additional Info: HTTP OK: HTTP/1.1 200 OK - 921 bytes in 0.028 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca02/Citoid is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Citoid Host: deployment-sca02 Address: 172.16.5.112 State: CRITICAL Date/Time: Wed 01 May 09:23:08 UTC 2019 Notes URLs: Additional Info: connect to address 172.16.5.112 and port 1970: Connection refused ___

[Betacluster-alerts] ** RECOVERY alert - deployment-fluorine02/Free space - all mounts is OK **

2019-05-01 Thread Shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-fluorine02 Address: 172.16.5.71 State: OK Date/Time: Wed 01 May 07:09:52 UTC 2019 Notes URLs: Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-fluorine02/Free space - all mounts is CRITICAL **

2019-05-01 Thread Shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-fluorine02 Address: 172.16.5.71 State: CRITICAL Date/Time: Wed 01 May 05:14:52 UTC 2019 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-fluorine02.diskspace._srv.byte_percentfree (<40.00%)