[Betacluster-alerts] Host DOWN alert for deployment-tmh01!

2018-03-05 Thread shinken
Notification Type: PROBLEM Host: deployment-tmh01 State: DOWN Address: 10.68.16.211 Info: CRITICAL - Host Unreachable (10.68.16.211) Date/Time: Mon 05 Mar 10:54:51 UTC 2018 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] Host DOWN alert for deployment-videoscaler01!

2018-03-05 Thread shinken
Notification Type: PROBLEM Host: deployment-videoscaler01 State: DOWN Address: 10.68.19.130 Info: CRITICAL - Host Unreachable (10.68.19.130) Date/Time: Mon 05 Mar 10:54:06 UTC 2018 ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-ores01/Puppet errors is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ores01 Address: 10.68.23.165 State: CRITICAL Date/Time: Mon 05 Mar 12:30:52 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Mon 05 Mar 13:30:28 UTC 2018 Notes URLs: Additional Info: WARNING: 80.00% of data above the warning threshold [1.0]

[Betacluster-alerts] Host DOWN alert for deployment-puppetdb01!

2018-03-05 Thread shinken
Notification Type: PROBLEM Host: deployment-puppetdb01 State: DOWN Address: 10.68.23.76 Info: CRITICAL - Host Unreachable (10.68.23.76) Date/Time: Mon 05 Mar 14:53:14 UTC 2018 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki04/Free space - all mounts is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki04 Address: 10.68.19.128 State: CRITICAL Date/Time: Mon 05 Mar 16:32:48 UTC 2018 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-mediawiki04.diskspace.root.byte_percentfree (<55.56%)

[Betacluster-alerts] ** PROBLEM alert - deployment-logstash2/Puppet errors is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: CRITICAL Date/Time: Mon 05 Mar 16:33:16 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki05/Free space - all mounts is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki05 Address: 10.68.22.21 State: WARNING Date/Time: Mon 05 Mar 16:33:46 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-mediawiki05.diskspace.root.byte_percentfree (<11.11%)

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki05/Free space - all mounts is OK **

2018-03-05 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-mediawiki05 Address: 10.68.22.21 State: OK Date/Time: Mon 05 Mar 16:38:47 UTC 2018 Notes URLs: Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] beta-scap-eqiad - Build # 198326 - Failure!

2018-03-05 Thread jenkins-bot
beta-scap-eqiad - Build # 198326 - Failure: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/198326/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] beta-scap-eqiad - Build # 198327 - Fixed!

2018-03-05 Thread jenkins-bot
beta-scap-eqiad - Build # 198327 - Fixed: Check console output at https://integration.wikimedia.org/ci/job/beta-scap-eqiad/198327/ to view the results.___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: WARNING Date/Time: Mon 05 Mar 18:36:39 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki07/App Server Main HTTP Response is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki07 Address: 10.68.17.40 State: CRITICAL Date/Time: Mon 05 Mar 18:34:35 UTC 2018 Notes URLs: Additional Info: HTTP CRITICAL: HTTP/1.1 500 hphp_invoke - string 'Wikipedia' not found on

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-03-05 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Mon 05 Mar 19:09:28 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: CRITICAL Date/Time: Mon 05 Mar 20:01:40 UTC 2018 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Mon 05 Mar 19:21:30 UTC 2018 Notes URLs: Additional Info: WARNING: 40.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki04/Free space - all mounts is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki04 Address: 10.68.19.128 State: WARNING Date/Time: Mon 05 Mar 19:53:49 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-mediawiki04.diskspace.root.byte_percentfree (<11.11%)

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki04/Free space - all mounts is OK **

2018-03-05 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-mediawiki04 Address: 10.68.19.128 State: OK Date/Time: Mon 05 Mar 20:03:46 UTC 2018 Notes URLs: Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-secureredirexperiment/Puppet errors is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-secureredirexperiment Address: 10.68.17.132 State: CRITICAL Date/Time: Mon 05 Mar 18:54:27 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Mon 05 Mar 18:54:30 UTC 2018 Notes URLs: Additional Info: WARNING: 80.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2018-03-05 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Mon 05 Mar 13:45:28 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Mon 05 Mar 13:57:29 UTC 2018 Notes URLs: Additional Info: WARNING: 40.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki04/Free space - all mounts is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-mediawiki04 Address: 10.68.19.128 State: CRITICAL Date/Time: Mon 05 Mar 20:14:48 UTC 2018 Notes URLs: Additional Info: CRITICAL: deployment-prep.deployment-mediawiki04.diskspace.root.byte_percentfree (<11.11%)

[Betacluster-alerts] ** PROBLEM alert - deployment-mx02/Puppet errors is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mx02 Address: 10.68.23.220 State: CRITICAL Date/Time: Mon 05 Mar 21:08:21 UTC 2018 Notes URLs: Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: WARNING Date/Time: Mon 05 Mar 20:11:39 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints

[Betacluster-alerts] ** PROBLEM alert - deployment-snapshot01/Free space - all mounts is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-snapshot01 Address: 10.68.19.94 State: WARNING Date/Time: Mon 05 Mar 20:17:33 UTC 2018 Notes URLs: Additional Info: WARNING: deployment-prep.deployment-snapshot01.diskspace._data.byte_percentfree (No valid

[Betacluster-alerts] ** RECOVERY alert - deployment-snapshot01/Free space - all mounts is OK **

2018-03-05 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-snapshot01 Address: 10.68.19.94 State: OK Date/Time: Mon 05 Mar 20:22:32 UTC 2018 Notes URLs: Additional Info: OK: deployment-prep.deployment-snapshot01.diskspace._data.byte_percentfree (No valid datapoints found)

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki04/Free space - all mounts is OK **

2018-03-05 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-mediawiki04 Address: 10.68.19.128 State: OK Date/Time: Mon 05 Mar 20:24:47 UTC 2018 Notes URLs: Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Tue 06 Mar 04:53:30 UTC 2018 Notes URLs: Additional Info: WARNING: 80.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs01/Puppet errors is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: CRITICAL Date/Time: Tue 06 Mar 05:10:55 UTC 2018 Notes URLs: Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Tue 06 Mar 04:48:29 UTC 2018 Notes URLs: Additional Info: CRITICAL: 20.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Tue 06 Mar 04:58:28 UTC 2018 Notes URLs: Additional Info: CRITICAL: 60.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Tue 06 Mar 04:43:29 UTC 2018 Notes URLs: Additional Info: WARNING: 40.00% of data above the warning threshold [1.0]

[Betacluster-alerts] ** RECOVERY alert - deployment-aqs01/Puppet errors is OK **

2018-03-05 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: OK Date/Time: Tue 06 Mar 05:45:53 UTC 2018 Notes URLs: Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2018-03-05 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Tue 06 Mar 05:03:29 UTC 2018 Notes URLs: Additional Info: WARNING: 80.00% of data above the warning threshold [1.0]