Junping Du created YARN-4274: -------------------------------- Summary: NodeStatusUpdaterImpl should register to RM again after a non-fatal exception happen before Key: YARN-4274 URL: https://issues.apache.org/jira/browse/YARN-4274 Project: Hadoop YARN Issue Type: Improvement Reporter: Junping Du Assignee: Junping Du
>From YARN-3896, an non-fatal exception like response ID mismatch between NM >and RM (due to a race condition) will cause NM stop working. I think we should >make it more robust to tolerant a few times failure in registering to RM. -- This message was sent by Atlassian JIRA (v6.3.4#6332)