nada-attia opened a new issue, #18149:
URL: https://github.com/apache/hudi/issues/18149

   ### Describe the problem you faced
   
   When rollback operations fail in Hudi, there is no metric emitted to track 
these failures. This makes it difficult to monitor and alert on rollback issues 
in production environments.
   
   ### Describe the solution you'd like
   
   Add a new metric `rollback.failure` that is emitted when a rollback 
operation fails. Additionally, emit a metric with the exception reason for more 
granular tracking.
   
   ### Describe alternatives you've considered
   
   Relying solely on logs for rollback failure detection, but metrics provide 
better integration with monitoring and alerting systems.
   
   ### Additional context
   
   Implementation PR: #18148


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to