CRZbulabula commented on PR #17821: URL: https://github.com/apache/iotdb/pull/17821#issuecomment-4656212363
@Caideyipi Thanks for the detailed review. Fixed in the latest commits, especially f54b4847f0. 1. AINode registration now treats CONFIG_NODE_LEADER_WARMING_UP as retryable instead of a hard failure. 2. Non-seed ConfigNode registration now waits and retries when the leader returns CONFIG_NODE_LEADER_WARMING_UP. 3. ConfigRegionStateMachine now uses a leader-services epoch guard, serializes startup and cleanup with leaderServicesLock, and re-checks the epoch before marking leader services ready. 4. DataNode registration now uses a 60s warm-up retry deadline, so it has requests after the 30s first-heartbeat tolerance. I also cleaned up the follow-up warm-up sampling concerns: removed the unreported DataNode Region heartbeat chain, removed the extra DataNodeHeartbeatHandler region-group argument, and kept consensus sampling to only cache leader samples when the DataNode reports leader=true with a consensus logical timestamp. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
