[Betacluster-alerts] ** PROBLEM alert - deployment-cache-text04/Puppet errors is CRITICAL **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cache-text04 Address: 10.68.18.103 State: CRITICAL Date/Time: Wed 13 Jun 21:43:24 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-deploy-01/SSH is CRITICAL **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-deploy-01 Address: 10.68.22.177 State: CRITICAL Date/Time: Wed 13 Jun 20:09:47 UTC 2018 Notes URLs: Additional Info: Connection refused ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 13 Jun 19:53:46 UTC 2018 Notes URLs: Additional Info: WARNING: 20.00% of data above the warning threshold [1.0]

[Betacluster-alerts] Host DOWN alert for deployment-mx!

2018-06-13 Thread shinken
Notification Type: PROBLEM Host: deployment-mx State: DOWN Address: 10.68.17.78 Info: CRITICAL - Host Unreachable (10.68.17.78) Date/Time: Wed 13 Jun 19:51:23 UTC 2018 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** RECOVERY alert - deployment-maps03/Puppet errors is OK **

2018-06-13 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-maps03 Address: 10.68.18.91 State: OK Date/Time: Wed 13 Jun 19:28:52 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-06-13 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 13 Jun 19:07:47 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 13 Jun 19:02:46 UTC 2018 Notes URLs: Additional Info: WARNING: 20.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-06-13 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Wed 13 Jun 18:56:46 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Wed 13 Jun 18:51:49 UTC 2018 Notes URLs: Additional Info: WARNING: 20.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-kafka-jumbo-1/Puppet errors is OK **

2018-06-13 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-kafka-jumbo-1 Address: 10.68.23.243 State: OK Date/Time: Wed 13 Jun 18:41:01 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka-jumbo-1/Puppet errors is CRITICAL **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka-jumbo-1 Address: 10.68.23.243 State: CRITICAL Date/Time: Wed 13 Jun 18:15:59 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is WARNING **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: WARNING Date/Time: Wed 13 Jun 18:11:11 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is CRITICAL **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: CRITICAL Date/Time: Wed 13 Jun 18:01:09 UTC 2018 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints

[Betacluster-alerts] ** RECOVERY alert - deployment-cpjobqueue/Puppet errors is OK **

2018-06-13 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-cpjobqueue Address: 10.68.22.161 State: OK Date/Time: Wed 13 Jun 17:24:11 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-puppetmaster03/Long lived cherry-picks on puppetmaster is CRITICAL **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Long lived cherry-picks on puppetmaster Host: deployment-puppetmaster03 Address: 10.68.23.29 State: CRITICAL Date/Time: Wed 13 Jun 16:09:36 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-maps03/Puppet errors is CRITICAL **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-maps03 Address: 10.68.18.91 State: CRITICAL Date/Time: Wed 13 Jun 13:18:45 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-aqs01/Puppet staleness is OK **

2018-06-13 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-aqs01 Address: 10.68.18.237 State: OK Date/Time: Wed 13 Jun 11:47:30 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cpjobqueue/Puppet errors is CRITICAL **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-cpjobqueue Address: 10.68.22.161 State: CRITICAL Date/Time: Wed 13 Jun 09:07:23 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs01/Puppet staleness is WARNING **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-aqs01 Address: 10.68.18.237 State: WARNING Date/Time: Wed 13 Jun 09:02:31 UTC 2018 Notes URLs: Additional Info: WARNING: 66.67% of data above the warning threshold [3600.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is WARNING **

2018-06-13 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: WARNING Date/Time: Wed 13 Jun 08:51:10 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints