nada-attia opened a new issue, #18149: URL: https://github.com/apache/hudi/issues/18149
### Describe the problem you faced When rollback operations fail in Hudi, there is no metric emitted to track these failures. This makes it difficult to monitor and alert on rollback issues in production environments. ### Describe the solution you'd like Add a new metric `rollback.failure` that is emitted when a rollback operation fails. Additionally, emit a metric with the exception reason for more granular tracking. ### Describe alternatives you've considered Relying solely on logs for rollback failure detection, but metrics provide better integration with monitoring and alerting systems. ### Additional context Implementation PR: #18148 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
