neils-dev commented on PR #3781: URL: https://github.com/apache/ozone/pull/3781#issuecomment-1279598223
Thanks @sodonnel for your help to expose the decommission / maintenance metrics for monitoring the workflow. As you suggested, I've added metrics to monitor the workflow progress by host. These host based metrics are created dynamically and track the pipeline and container state for datanodes going through the decommissioning and maintenance workflow. The metrics include, node_decommission_metrics_tracked_pipelines_waiting_to_close_**ozone_datanode_3_ozone_default** node_decommission_metrics_tracked_sufficiently_replicated_**ozone_datanode_3_ozone_default** node_decommission_metrics_tracked_unhealthy_containers_**localhost** node_decommission_metrics_tracked_under_replicated_containers_**localhost** **(hostname marked in bold)** -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@ozone.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@ozone.apache.org For additional commands, e-mail: issues-h...@ozone.apache.org