Thank you for sharing your experience with us. You are saying that your machine workload does not match the machine type requirement and that you didn’t expect to upgrade to a class F2 instance. I will try to help.
There are some ways to mitigate this error message [1] such as: 1. Predict spikes and preemptively load instances. Warmup requests [2] are designed specifically to combat situations that involve predictable frequent sudden spikes. Warmup requests would "know" when they expect spikes so it preemptively load up instances for you to avoid cold booting during spikes. 2. Make cold boots faster [3]. You can make cold boot loading faster by having less complex code with less libraries that need to be loaded. There is an interesting article [4] on how to improve the loading performance. 3. Provisioning more resources is one of the easier solutions such as idle instances to avoid cold boots, however this might not be ideal as it could increase your costs. 4. Retry strategy if your app can accept x amount of transient failures (within our SLO), then you can simply catch those failures with a retry and your app can function without any issue. Lastly, I would like to concur David’s suggestion to raise an official case with the GCP support [5] as the error message [1] can be due to multiple reasons (e.g sudden spiky traffic, backend issues, etc) and we do have the tools to diagnose such issues. With the right diagnosis of your case, we can determine the recommendations which might avoid resorting to changing the machine type and increasing your costs. [1] "logMessage": "Request was aborted after waiting too long to attempt to service your request." [2] https://cloud.google.com/appengine/docs/standard/go111/configuring-warmup-requests [3] https://cloud.google.com/appengine/docs/standard/go/how-instances-are-managed#loading_requests [4] https://cloud.google.com/blog/products/gcp/best-practices-for-app-engine-startup-time-google-cloud-performance-atlas [5] https://cloud.google.com/support-hub On Tuesday, March 8, 2022 at 6:33:43 PM UTC-5 Viktor Bresan wrote: > I had similar problem few months ago when suddenly extra instances were > created for another app that wasn't receiving any extra traffic. And I am > not the only one who experienced that. Something is obviously happening on > the google side, it's ridiculous that I can't serve a simple request with > F1 instance. > > And it would be ridiculous to spend $29 a month for support while the > total running cost of my apps is $0.08 a month. > > > On Monday, March 7, 2022 at 10:59:04 PM UTC+1 David (Cloud Platform > Support) wrote: > >> Glad to hear that switching to a F2 instance class fixed the issue. It’s >> hard to say whether the issue was caused by a lack of resources even after >> looking at the log you provided. Which is why I would recommend you to >> contact GCP support <https://cloud.google.com/support-hub> If this issue >> happens again even after having upgraded instance type, since they can >> inspect your GAE service and provide you with more useful information. >> >> On Saturday, March 5, 2022 at 12:11:16 PM UTC-5 Viktor Bresan wrote: >> >>> >>> Many thanks for your tips! The trace is not available for failed >>> request, for all other it is. I haven't pasted log earlier, perhaps I >>> should, because (in my opinion) it isn't showing anything. Here it is >>> (attached). >>> >>> Meanwhile, I have switched to instance class F2 and so far the problem >>> did not happen again. I don't have latency longer than 3s, when new process >>> is started. Though I don't think the problem should have happened earlier, >>> or F1 is useless. >>> >>> >>> -- You received this message because you are subscribed to the Google Groups "Google App Engine" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/google-appengine/3b582dfa-e856-424e-b8ac-acc8658f8faan%40googlegroups.com.
