[ https://issues.apache.org/jira/browse/YARN-4408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15037987#comment-15037987 ]
Hudson commented on YARN-4408: ------------------------------ FAILURE: Integrated in Hadoop-Hdfs-trunk-Java8 #661 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk-Java8/661/]) YARN-4408. Fix issue that NodeManager still reports negative running (junping_du: rev 62e9348bc10bb97a5fcb4281f7996a09d8e69c60) * hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/container/ContainerImpl.java * hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/container/TestContainer.java * hadoop-yarn-project/CHANGES.txt > NodeManager still reports negative running containers > ----------------------------------------------------- > > Key: YARN-4408 > URL: https://issues.apache.org/jira/browse/YARN-4408 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager > Affects Versions: 2.4.0 > Reporter: Robert Kanter > Assignee: Robert Kanter > Fix For: 2.8.0 > > Attachments: YARN-4408.001.patch, YARN-4408.002.patch, > YARN-4408.003.patch > > > YARN-1697 fixed a problem where the NodeManager metrics could report a > negative number of running containers. However, it missed a rare case where > this can still happen. > YARN-1697 added a flag to indicate if the container was actually launched > ({{LOCALIZED}} to {{RUNNING}}) or not ({{LOCALIZED}} to {{KILLING}}), which > is then checked when transitioning from {{CONTAINER_CLEANEDUP_AFTER_KILL}} to > {{DONE}} and {{EXITED_WITH_FAILURE}} to {{DONE}} to only decrement the gauge > if we actually ran the container and incremented the gauge . However, this > flag is not checked while transitioning from {{EXITED_WITH_SUCCESS}} to > {{DONE}}. -- This message was sent by Atlassian JIRA (v6.3.4#6332)