errose28 commented on PR #7499: URL: https://github.com/apache/ozone/pull/7499#issuecomment-2512559133
> We change the default value to 0, user still can modify it to some other value, for example "1", based on their own needs. We should strive for intuitive configurations out of the box to help usability as much as possible. IMO `-1` is expected behavior and will not lead to a surprise node shutdown. > -1 means even 19 disks fails, the datanode is still running, the administrator will not know that if he doesn't pay attention to that. > The DNs should be monitored for disk failures and they should be investigated. It should not need a DN to shutdown to make that happen. This is the real problem: we do not have a good alerting system for disk failures. A new Recon page (HDDS-11840), dashboards, and changes like #7266 can all remediate this problem without introducing potentially surprising config changes. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
