Can anybody point me to more info on how automatic scaling works when using websockets?
https://cloud.google.com/appengine/docs/flexible/python/how-instances-are-managed mentions automatic scaling is done based on response latency and request rate. But my app engine app is only serving websockets and thus the request rate and number of simultaneous request in-flight are less relevant. And when scaling down, will the instance with the least number of connections be killed or how does that selection process work? And when scaling up but the new instance is still warming up, will the websocket-connection go to one of the already existing instances or will the request receive an error? Thanks in advance for clarifying toon -- You received this message because you are subscribed to the Google Groups "Google App Engine" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/google-appengine/c29ba730-a0ac-49f9-a5e7-aaba1ddd5d52%40googlegroups.com.
