xyuanlu opened a new issue #1291: URL: https://github.com/apache/helix/issues/1291
### Describe the bug The controller switch time for large scale cluster is slow. It takes around ~60s for a standby controller to obtain leadership of a cluster with ~50 participants and ~700 resources. The service's availability is impacted under this period of time. ### To Reproduce This issue can be constantly reproduced in large cluster set up. ### Expected behavior The ultimate goal would be reducing the leadership handoff time to ~10s. ### Additional context The performance improvement is a long process. We should do small improvements one at a time and achieve the final goal step by step. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
