Has anyone else noticed Amazon AWS becoming less and less reliable in the last several months?
It is getting really annoying to me. There are many things I've observed, but this is the easiest one to talk about: We have a zabbix server hosted in amazon, monitoring all the other systems in amazon, as well as on our private LAN, and other datacenters. I find nowadays, that we can't go a single day without triggering alerts, "PROBLEM: Zabbix agent on ____ is unreachable for 3 minutes". Followed immediately by "RECOVERED" a minute or two later. Usually, the alerts are 100% inside of Amazon, in fact, 100% in Virginia. Sure occasionally an alert about other datacenters - When there's an alert outside of AWS, it's usually some random network provider cutting the internet in the middle of the night (we see half a dozen systems trigger alerts at the same time and then come back at the same time). But those only occur, I'll guess monthly, while like I said, the AWS alerts are happening typically on a daily basis, at random times of day or night. I almost missed a real alert buried in all the noise. Side note: There's one company I work for, that was able to switch OFF of AWS. We went to digital ocean, where everything is 2-4 times faster, requires 1/4 as much work, and costs half as much. So it was a big win. Unfortunately the company experiencing the above is firmly entrenched in AWS and would be extremely difficult to relocate.
_______________________________________________ Tech mailing list Tech@lists.lopsa.org https://lists.lopsa.org/cgi-bin/mailman/listinfo/tech This list provided by the League of Professional System Administrators http://lopsa.org/