adoroszlai commented on PR #7541: URL: https://github.com/apache/ozone/pull/7541#issuecomment-2536685260
Thanks @slfan1989 for sharing details. > Currently, my approach is to manually compare the DataNode lists from the two SCMs, identify the DataNodes that failed to register, track these DataNodes, and take appropriate actions (in most cases, this involves restarting the DataNodes). > > The purpose of retrieving the DataNode list from the Pipeline List is to identify any unregistered DataNodes, as shown in my screenshot. We could implement new (or improve existing) `ozone admin` command to help with that. It may need backend changes, but not `DataNodeSafeModeRule`. > 2. The reason for improving the DataNodeSafeModeRule is that this rule is difficult to apply effectively in real-world usage. I think the sentiment here is that other safemode rules should help with that. `DataNodeSafeModeRule` is kind of a "minimum bar" for the cluster to pass. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
