smallzhongfeng commented on issue #127: URL: https://github.com/apache/incubator-uniffle/issues/127#issuecomment-1205999294
The reason is that the `ShuffleServer` returns the heartbeat at a frequency of 10s each time. The `Coordinator` filters the `ShuffleServer` whose available memory is lower than the threshold value, which may cause the `ShuffleServer` to be lower than the allocable quantity. In this way, it is pushed down to SSM, add a retry logic and two parameters, retry times and time. Within the user's expected retry time, run on RSS as much as possible.WDYT? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
