And here's what happens when the control plane itself falls over:

https://status.cloud.google.com/incident/zall/20011#20011006

It seems pretty clear that Cloud needs ANIMA.

Regards
   Brian

On 01-Dec-20 11:02, Brian E Carpenter wrote:
> "AWS reveals it broke itself by exceeding OS thread limits"
> 
> https://www.theregister.com/2020/11/30/aws_outage_explanation/
> 
> Especially:
> "The TIFU-like post also outlines why Amazon's dashboards offered only scanty 
> info about the incident – because they, too, depend on a service that depends 
> on Kinesis."
> 
> Perhaps there is something we should specify in ANIMA to prevent the ANIMA 
> infrastructure falling into this sort of trap: when there is a system-wide 
> issue (such as hitting an O/S resource limit everywhere at the same time) it 
> also prevents the autonomic mechanisms from working.
>  
> Regards
>    Brian Carpenter
> 

_______________________________________________
Anima mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/anima

Reply via email to