zhuxt2015 opened a new issue, #10874: URL: https://github.com/apache/dolphinscheduler/issues/10874
### Search before asking - [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar feature requirement. ### Description ## 目前zookeeper的作用 1. 存储master和worker的服务相关信息,所在主机IP、端口、cpu负载等等 2. master和worker健康检测 3. master故障切换 4. 分布式锁 ## zookeeper带来的问题 1. 增加系统部署和运维的复杂度。 部署DS集群的同时, 还要部署一套zookeeper集群。还要能同时监控运维DS集群和zookeeper集群 1. 增加出错机率。DS集群和zookeeper集群之间网路出现抖动, 有可能出现错误。 ## 移除zookeeper的有点 1. 部署架构更简单 2. 维护成本更低 3.不再因为zookeeper集群出错导致DS报错 ## 移除zookeeper方案 <img width="700" alt="image" src="https://user-images.githubusercontent.com/13765310/178137541-39521953-39f9-40ff-9008-df983e66d1b8.png"> 1. 参考Kafka的KRaft方案,实现master之间的leader选举和数据同步 2. worker server向leader master注册和并与master保持心跳 3. Api server从leader master获取集群信息 ### Use case _No response_ ### Related issues #6680 ### Are you willing to submit a PR? - [X] Yes I am willing to submit a PR! ### Code of Conduct - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
