LittleBoy18 opened a new pull request, #9843:
URL: https://github.com/apache/rocketmq/pull/9843

   ### Brief Description
   代码健壮性增强
   
   ### How Did You Test This Change?
   部署:
   rocketmq 5.3.0部署controller模式
   一主三从
   
   故障测试:
   对其中两个broker做网络延迟故障,网络延迟在1分钟以上。故障恢复发现始终有一个broker不能加入到SyncStateSet里
   
   代码改造后测试:
   
ReplicasManager的scanControllerAddress线程,在有网络延迟的情况相下确实有可能挂掉,影响AvailableControllerAddress的维护导致AvailableControllerAddress并不包含所有的controller节点,进而影响broker上报心跳到AvailableControllerAddress下的所有controller节点
   <img width="1913" height="329" alt="image" 
src="https://github.com/user-attachments/assets/c855d862-5e07-4bb6-be27-8f27195aaa27";
 />
   
   
   <!-- In order to ensure the code quality of Apache RocketMQ, we expect every 
pull request to have undergone thorough testing. -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to