[
https://issues.apache.org/jira/browse/YARN-965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arun C Murthy updated YARN-965:
-------------------------------
Fix Version/s: (was: 2.4.0)
2.5.0
> NodeManager Metrics containersRunning is not correct When localizing
> container process is failed or killed
> ----------------------------------------------------------------------------------------------------------
>
> Key: YARN-965
> URL: https://issues.apache.org/jira/browse/YARN-965
> Project: Hadoop YARN
> Issue Type: Bug
> Components: nodemanager
> Affects Versions: 2.0.4-alpha
> Environment: suse linux
> Reporter: Li Yuan
> Fix For: 2.5.0
>
>
> When successfully launched a container, container state from LOCALIZED to
> RUNNING, containersRunning ++. Container state from EXITED_WITH_FAILURE or
> KILLING to DONE, containersRunning--.
> However, state EXITED_WITH_FAILURE or KILLING could come from
> LOCALIZING(LOCALIZED), not RUNNING, which caused containersRunningis less
> than the actual number. Further more, Metrics is wrong, containersLaunched !=
> containersCompleted + containersFailed + containersKilled + containersRunning
> + containersIniting
--
This message was sent by Atlassian JIRA
(v6.2#6252)