[ https://issues.apache.org/jira/browse/YARN-965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Arun C Murthy updated YARN-965: ------------------------------- Fix Version/s: (was: 2.6.0) 2.7.0 > NodeManager Metrics containersRunning is not correct When localizing > container process is failed or killed > ---------------------------------------------------------------------------------------------------------- > > Key: YARN-965 > URL: https://issues.apache.org/jira/browse/YARN-965 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager > Affects Versions: 2.0.4-alpha > Environment: suse linux > Reporter: Li Yuan > Fix For: 2.7.0 > > > When successfully launched a container, container state from LOCALIZED to > RUNNING, containersRunning ++. Container state from EXITED_WITH_FAILURE or > KILLING to DONE, containersRunning--. > However, state EXITED_WITH_FAILURE or KILLING could come from > LOCALIZING(LOCALIZED), not RUNNING, which caused containersRunningis less > than the actual number. Further more, Metrics is wrong, containersLaunched != > containersCompleted + containersFailed + containersKilled + containersRunning > + containersIniting -- This message was sent by Atlassian JIRA (v6.3.4#6332)