[ https://issues.apache.org/jira/browse/YARN-965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Arun C Murthy updated YARN-965: ------------------------------- Fix Version/s: (was: 2.1.0-beta) 2.3.0 > NodeManager Metrics containersRunning is not correct When localizing > container process is failed or killed > ---------------------------------------------------------------------------------------------------------- > > Key: YARN-965 > URL: https://issues.apache.org/jira/browse/YARN-965 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager > Affects Versions: 2.0.4-alpha > Environment: suse linux > Reporter: Li Yuan > Fix For: 2.3.0 > > > When successfully launched a container, container state from LOCALIZED to > RUNNING, containersRunning ++. Container state from EXITED_WITH_FAILURE or > KILLING to DONE, containersRunning--. > However, state EXITED_WITH_FAILURE or KILLING could come from > LOCALIZING(LOCALIZED), not RUNNING, which caused containersRunningis less > than the actual number. Further more, Metrics is wrong, containersLaunched != > containersCompleted + containersFailed + containersKilled + containersRunning > + containersIniting -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira