There was heavy load on GCE and we hit disk use limit (24MB/sec for 50GB drive), possibly while trying to swap or restart containers. The result apart from several minutes downtime was that automatic recovery stalled on 2 out of 3 instances. I will try to prevent this stalling from happening in the future and for now manually restarted 2 nodes.
-- You received this message because you are subscribed to the Google Groups "sage-cell" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/sage-cell/ee12f1d2-e473-433a-9772-a888addde642%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
