Li Yuan created YARN-965:
----------------------------
Summary: NodeManager Metrics containersRunning is not correct When
localizing container process is failed or killed
Key: YARN-965
URL: https://issues.apache.org/jira/browse/YARN-965
Project: Hadoop YARN
Issue Type: Bug
Components: nodemanager
Affects Versions: 2.0.4-alpha
Environment: suse linux
Reporter: Li Yuan
Fix For: 2.1.0-beta
When successfully launched a container, container state from LOCALIZED to
RUNNING, containersRunning ++. Container state from EXITED_WITH_FAILURE or
KILLING to DONE, containersRunning--.
However, state EXITED_WITH_FAILURE or KILLING could come from
LOCALIZING(LOCALIZED), not RUNNING, which caused containersRunningis less than
the actual number. Further more, Metrics is wrong, containersLaunched !=
containersCompleted + containersFailed + containersKilled + containersRunning +
containersIniting
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira