Rohith created YARN-1686:
----------------------------
Summary: NodeManager.resyncWithRM() does not handle exception
which cause NodeManger to Hang.
Key: YARN-1686
URL: https://issues.apache.org/jira/browse/YARN-1686
Project: Hadoop YARN
Issue Type: Bug
Components: nodemanager
Affects Versions: 2.3.0
Reporter: Rohith
Assignee: Rohith
During start of NodeManager,if registration with resourcemanager throw
exception then nodemager shutdown happens.
Consider case where NM-1 is registered with RM. RM issued Resync to NM. If any
exception thrown in "resyncWithRM" (starts new thread which does not handle
exception) during RESYNC evet, then this thread is lost. NodeManger enters
hanged state.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)