zuston opened a new issue, #234:
URL: https://github.com/apache/incubator-uniffle/issues/234

   ### Background
   When changing some coordinator's conf and then restart, coordinator will 
accept client `getAssignment` request immediately, but it will serve for jobs 
request based on the partial registered shuffle-servers, which will make some 
jobs gotten not enough required shuffle-servers and then slow the running speed.
   
   I think we should make coordinator wait for more than one shuffle-server 
heartbeat interval before serving for client. During out-of-service, requests 
from client will fallback to slave coordinator.
   
   Besides, I think this rejection mechanism could be enabled by the 
coordinator conf.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@uniffle.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to