[Discuss] Monitoring your AWS instances

2014-09-28 Thread Edward Ned Harvey (blu)
I would really like to hear from anybody else who has AWS machines, and alerting/monitoring of those systems (by a system other than Amazon's own monitoring system). The number of alerts I'm receiving about systems being unreachable and then becoming reachable again is ... Crazy to say the

Re: [Discuss] Monitoring your AWS instances

2014-09-28 Thread Chuck Anderson
On Sun, Sep 28, 2014 at 01:56:07PM +, Edward Ned Harvey (blu) wrote: I would really like to hear from anybody else who has AWS machines, and alerting/monitoring of those systems (by a system other than Amazon's own monitoring system). The number of alerts I'm receiving about systems

Re: [Discuss] Monitoring your AWS instances

2014-09-28 Thread Edward Ned Harvey (blu)
None of our systems are scheduled for reboot, *and* the machines triggering our alerts have not been rebooting - they just become unavailable on the network for a few minutes and then reappear, without any sort of crash or reboot or anything affecting uptime. *And* this has been a general

Re: [Discuss] Monitoring your AWS instances

2014-09-28 Thread Edward Ned Harvey (blu)
From: discuss-bounces+blu=nedharvey@blu.org [mailto:discuss- bounces+blu=nedharvey@blu.org] On Behalf Of Chuck Anderson Maybe this? https://aws.amazon.com/blogs/aws/ec2-maintenance-update/ I am aware of that - but they say less than 10% of their systems will be affected, and

Re: [Discuss] Monitoring your AWS instances

2014-09-28 Thread Matt Shields
Did you get an email telling you about reboots begin scheduled? I know I have a number of systems being rebooted today around 2pm. If you log into the console, and go to EC2 then click on Events on the left side it will show you any ones that are scheduled in the future. If you change one of

Re: [Discuss] Monitoring your AWS instances

2014-09-28 Thread Jason Normand
We use Nagios and nrpe with a pretty standard monitoring setup. We also pull in cloudwatch data via python goto scripts. As for the uptick I would avoid making any changes until after the AWS maintenance. We have also noticed an uptick in alerts,though not to that level. However just because