Can anybody point me to more info on how automatic scaling works when using 
websockets?

https://cloud.google.com/appengine/docs/flexible/python/how-instances-are-managed
 mentions 
automatic scaling is done based on response latency and request rate. But 
my app engine app is only serving websockets and thus the request rate and 
number of simultaneous request in-flight are less relevant.

And when scaling down, will the instance with the least number of 
connections be killed or how does that selection process work?

And when scaling up but the new instance is still warming up, will the 
websocket-connection go to one of the already existing instances or will 
the request receive an error?

Thanks in advance for clarifying

toon

-- 
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/google-appengine/c29ba730-a0ac-49f9-a5e7-aaba1ddd5d52%40googlegroups.com.

Reply via email to