[ 
https://issues.apache.org/jira/browse/HDFS-6688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14062875#comment-14062875
 ] 

Andrew Wang commented on HDFS-6688:
-----------------------------------

I admit we don't have the greatest (any?) documentation for metrics. Typically 
I have to dig around in the code. The stale node feature I don't think has much 
in the way of docs either, so if you see a good place to add a mention, we'd 
happily take a docs patch.

The 10.5 mins is also somewhat configurable. It's calculated from this in 
DatanodeManager:

{code}
    this.heartbeatExpireInterval = 2 * heartbeatRecheckInterval
        + 10 * 1000 * heartbeatIntervalSeconds;
{code}

So if you adjust "dfs.namenode.heartbeat.recheck-interval" and 
"dfs.heartbeat.interval" you have some control.

> Hadoop JMX stats are not refreshed
> ----------------------------------
>
>                 Key: HDFS-6688
>                 URL: https://issues.apache.org/jira/browse/HDFS-6688
>             Project: Hadoop HDFS
>          Issue Type: Bug
>         Environment: Ubuntu
>            Reporter: Biju Nair
>
> Even when the HDFS datanode process is stopped the JMX attribute 
> Hadoop.NameNode.FSNamesystemState.NumLiveDataNodes/NumDeadDataNodes attribute 
> values doesn't change. Also Hadoop.NameNode.NameNodeInfo.Attributes.LiveNodes 
> shows the stopped datanode details. If these attributes reflect the actual 
> changes in the datanode, they can be used to monitor the health of the HDFS 
> cluster which currently can't be used.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to