Haibin Huang created HDFS-15745:
-----------------------------------
Summary: Make DataNodePeerMetrics#LOW_THRESHOLD_MS and
MIN_OUTLIER_DETECTION_NODES configurable
Key: HDFS-15745
URL: https://issues.apache.org/jira/browse/HDFS-15745
Project: Hadoop HDFS
Issue Type: Improvement
Reporter: Haibin Huang
Assignee: Haibin Huang
Attachments: image-2020-12-22-17-00-50-796.png
When i enable DataNodePeerMetrics to find slow slow peer in cluster, i found
there is a lot of slow peer but ReportingNodes's averageDelay is very low, and
these slow peer node are normal. I think the reason of why generating so many
slow peer is that the value of DataNodePeerMetrics#LOW_THRESHOLD_MS is too
small (only 5ms) and it is not configurable. The default value of slow io
warning log threshold is 300ms, i.e.
DFSConfigKeys.DFS_DATANODE_SLOW_IO_WARNING_THRESHOLD_DEFAULT = 300, so
DataNodePeerMetrics#LOW_THRESHOLD_MS should not be less than 300ms, otherwise
namenode will get a lot of invalid slow peer information.
!image-2020-12-22-17-00-50-796.png!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]