gaoyajun02 commented on PR #3261: URL: https://github.com/apache/celeborn/pull/3261#issuecomment-2999300100
> * we have apps with **100k-600k mappers in a single stage (and multiple such stages)** that have been running reliably and performantly Additionally, we still have concerns about high concurrency scenarios (executor * cores). We have actually applied end-to-end consistency validation in production scenarios, and recently we've been analyzing cases of driver celeborn RPC timeouts (500k mappers, 8000*2 cores), which may not necessarily be related to this change. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
