[ https://issues.apache.org/jira/browse/YARN-1071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13908179#comment-13908179 ]
Hudson commented on YARN-1071: ------------------------------ FAILURE: Integrated in Hadoop-Yarn-trunk #488 (See [https://builds.apache.org/job/Hadoop-Yarn-trunk/488/]) YARN-1071. Enabled ResourceManager to recover cluster metrics numDecommissionedNMs after restarting. Contributed by Jian He. (zjshen: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1570469) * /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/ClusterMetrics.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/NodesListManager.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmnode/RMNodeImpl.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMNodeTransitions.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMRestart.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestResourceTrackerService.java > ResourceManager's decommissioned and lost node count is 0 after restart > ----------------------------------------------------------------------- > > Key: YARN-1071 > URL: https://issues.apache.org/jira/browse/YARN-1071 > Project: Hadoop YARN > Issue Type: Bug > Components: resourcemanager > Affects Versions: 2.1.0-beta > Reporter: Srimanth Gunturi > Assignee: Jian He > Fix For: 2.4.0 > > Attachments: YARN-1071.1.patch, YARN-1071.2.patch, YARN-1071.3.patch, > YARN-1071.4.patch, YARN-1071.5.patch, YARN-1071.6.patch > > > I had 6 nodes in a cluster with 2 NMs stopped. Then I put a host into YARN's > {{yarn.resourcemanager.nodes.exclude-path}}. After running {{yarn rmadmin > -refreshNodes}}, RM's JMX correctly showed decommissioned node count: > {noformat} > "NumActiveNMs" : 3, > "NumDecommissionedNMs" : 1, > "NumLostNMs" : 2, > "NumUnhealthyNMs" : 0, > "NumRebootedNMs" : 0 > {noformat} > After restarting RM, the counts were shown as below in JMX. > {noformat} > "NumActiveNMs" : 3, > "NumDecommissionedNMs" : 0, > "NumLostNMs" : 0, > "NumUnhealthyNMs" : 0, > "NumRebootedNMs" : 0 > {noformat} > Notice that the lost and decommissioned NM counts are both 0. -- This message was sent by Atlassian JIRA (v6.1.5#6160)