LittleBoy18 opened a new pull request, #9897: URL: https://github.com/apache/rocketmq/pull/9897
### Brief Description 版本:5.3.2 部署模式:controller mode,一主三从 复现方式: Producer持续向集群写入消息,并对其中两个broker做网络延迟故障测试(网络延迟1分钟),发现客户端接收到FLUSH_SLAVE_TIMEOUT的时间远大于20s. (默认20s内接收不到slave broker的心跳应该将slave踢出syncStateSet,客户端恢复正常) <img width="2700" height="786" alt="image" src="https://github.com/user-attachments/assets/c3e2cd66-2700-4a99-8801-a82dddd25f01" /> 并且从broker端store.log发现日志 <img width="1044" height="138" alt="image" src="https://github.com/user-attachments/assets/b6a770d0-177e-42f9-bcac-c8f94be0712a" /> 出现这个问题应该是removeConnection的时候同时断开两个broker连接修改本地synsStateSet的竞态问题导致故障时间延长 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
