MiaNaNa opened a new issue #6233: URL: https://github.com/apache/dolphinscheduler/issues/6233
### Search before asking - [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar issues. ### What happened 现在我们在三台机器60,61,62。目前我们遇到了一个问题,61节点原为zk集群leader,出现异常终止,61节点不能使用,zk集群重新选举62(原为follower)为leader,DS却不能向新的zk leader进行注册通信(62节点),导致整个DS处于不可用状态,此时需要到DS的部署机(60)去手动重启DS服务,才可以恢复原来的运行状态(三台机器实现loadAverage),想请问下这个情况如何排查。 ### What you expected to happen 我们初步猜测可能是zk和DS服务部署在相同机器上的原因,但真实原因未知,且无从下手。 ### How to reproduce kill掉zk的leader结点,待重新选举成功后,服务不再为高可用状态。 ### Anything else _No response_ ### Are you willing to submit PR? - [X] Yes I am willing to submit a PR! ### Code of Conduct - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
