I spoke too soon, the instance count has gone crazy again. I was wondering if this has something to do with the warmup requests.
I'm on GAE/Java with threadsafe enabled. The initial request of my app triggers a ServletContextListener, which takes more than 30 seconds to complete. This is indeed much higher than the configured max latency (I use the highest value, i.e. 15s). But the first request is a WARMUP request, so it shouldn't be used to evaluate the latency of an instance ! As the latency of every new instances is above the max value, additional instances are created as requests are coming in. => After a deployment, I get 40 fresh instances in just a few minutes, with only 1 request on each ! And of course the instance count never decreases, even when the latency is stable around 500ms, with an average QPS below 0.1. If I stop all traffic on the application, the instance count never goes back under 20 instances, which is already way too much for the needs of my app. Maybe I'm completely wrong about this, but I think there is a real problem here, and it's costing us money. -- You received this message because you are subscribed to the Google Groups "Google App Engine" group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/5xykL0FCRGQJ. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
