XComp commented on PR #22052: URL: https://github.com/apache/flink/pull/22052#issuecomment-1471786148
Thanks @zentol for sharing your thoughts. The two OOM errors that were reported in FLINK-31278 happened on Azure-owned hosts rather than the Alibaba machines. I understand that for the latter one, we can't rely on the failed module actually causing the error because of other modules that are being executed concurrently on the same Alibaba machine. But for the Azure-owned agents, I assumed that this is actually not the case. I would think that these agents are properly separated from each other. Am I wrong with my assumption here? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
