Wei-Chiu Chuang created HDDS-14807:
--------------------------------------
Summary: [Docs] Erasure Coding related metrics
Key: HDDS-14807
URL: https://issues.apache.org/jira/browse/HDDS-14807
Project: Apache Ozone
Issue Type: Wish
Components: documentation
Reporter: Wei-Chiu Chuang
Add to Core Concepts -> Replication -> Erasure Coding
https://ozone.apache.org/docs/core-concepts/replication/erasure-coding
the metrics related to Erasure Coding.
These metrics span across Datanode, SCM and client. Useful for troubleshooting
EC problems. For each metric, create a table and include metric name in JMX, in
Prometheus, and a description of the metric.
Datanode: ECReconstructionMetrics
* blockGroupReconstructionTotal
* blockGroupReconstructionFailsTotal
* reconstructionTotal
* reconstructionFailsTotal
Client: XceiverClientMetrics
* ecReconstructionTotal
* ecReconstructionFailsTotal
SCM: ReplicationManagerMetrics
* InflightEcReplication
* InflightEcDeletion
ecReplicationCmdsSentTotal
ecReconstructionCmdsSentTotal
ecReplicasCreatedTotal
ecReplicasDeletedTotal
ecReplicaCreateTimeoutTotal
ecReplicaDeleteTimeoutTotal
ecPartialReconstructionSkippedTotal
ecPartialReconstructionCriticalTotal
ecPartialReconstructionNoneOverloadedTotal
ecPartialReplicationForOutOfServiceReplicasTotal
SafeModeMetrics.java
* numContainerWithECDataReplicaReportedThreshold
* currentContainersWithECDataReplicaReportedCount
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]