Jason Lowe created YARN-5451:
--------------------------------
Summary: Container localizers that hang are not cleaned up
Key: YARN-5451
URL: https://issues.apache.org/jira/browse/YARN-5451
Project: Hadoop YARN
Issue Type: Bug
Components: nodemanager
Affects Versions: 2.6.0
Reporter: Jason Lowe
I ran across an old, rogue process on one of our nodes. It apparently was a
container localizer that somehow entered an infinite loop during startup. The
NM never cleaned up this broken localizer, so it happily ran forever. The NM
needs to do a better job of tracking localizers, including killing them if they
appear to be hung/broken.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]