devabhishekpal opened a new pull request, #7517:
URL: https://github.com/apache/ozone/pull/7517

   ## What changes were proposed in this pull request?
   HDDS-11680. Enhance Recon Metrics For Improved Observability
   
   Please describe your PR in detail:
   * Currently Recon metrics do not tell us much about the tasks that are 
running in the background for Recon. We only update the last sequence number 
and the last updated time for some metrics
   * This PR introduces a counter to store number of successful executions and 
number of failing executions for Recon tasks, a new configuration to store the 
duration for which the counts will be stored, after which the count is reset.
   * It also adds a new column to the RECON_TASK_STATUS table to store the last 
task status.
   * We also add upgrade handler to ensure this column is consistent when 
upgrading from a previous version to a new version of Recon
   * For SCM based tasks like ContainerHealthTask and PipelineSyncTask we also 
add checks for queued reports metrics so that we can provide insights on these 
tasks
   * We expose two new APIs as part of the TaskStatusService to fetch these new 
metrics.
   
   
   ## What is the link to the Apache JIRA
   https://issues.apache.org/jira/browse/HDDS-11680
   
   ## How was this patch tested?
   Patch was tested with Unit tests, and manually calling the APIs after 
building.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to