Hi, We use GAE to host one of our Java apps. Recently we have been experiencing quite a lot of HTTP 500 errors and timeouts from App Engine. It appears that App Engine sometimes restarts all of our instances at the same time and does not wait for instances to finish serving requests. See the attached images:
<https://lh6.googleusercontent.com/-u9WUEwSEbHU/T1jSS0w6b1I/AAAAAAAAAC4/JWA396Qp7r4/s1600/request_time_graph.png> <https://lh3.googleusercontent.com/-NlOB_4lO5yQ/T1jSPHXyf8I/AAAAAAAAACw/q1x5-6KgoCg/s1600/new_instances.png> In the logs I see a lot of entries like this: "Request was aborted after waiting too long to attempt to service your request." The request time shown in the logs was typically around 10,000ms before the request was terminated. The screenshots were taken at 23.45 GMT on 1 March. We have since experienced similar issues. The latest being yesterday afternoon at 15.30 GMT. The one yesterday was very odd. It appears that the graphs in the dashboard all got reset when the instances restarted as if the entire app had been redeployed with no history at all. So my questions: 1) Can someone in Google please look into this for me? 2) Why would all instances be restarted at the same time? 3) Why wouldn't App Engine let the instances finish serving requests before restarting? 4) Is there any way I can elavate this issue? 5) Is there a better place to report this issue so someone in Google will investigate? This is very serious and we need to get to the bottom of it. Thanks, Hamish -- You received this message because you are subscribed to the Google Groups "Google App Engine" group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/E2c_GofputsJ. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
