Bikas Saha created YARN-1367:
--------------------------------
Summary: After restart NM should resync with the RM without
killing containers
Key: YARN-1367
URL: https://issues.apache.org/jira/browse/YARN-1367
Project: Hadoop YARN
Issue Type: Sub-task
Reporter: Bikas Saha
After RM restart, the RM sends a resync response to NMs that heartbeat to it.
Upon receiving the resync response, the NM kills all containers and
re-registers with the RM. The NM should be changed to not kill the container
and instead inform the RM about all currently running containers including
their allocations etc. After the re-register, the NM should send all pending
container completions to the RM as usual.
--
This message was sent by Atlassian JIRA
(v6.1#6144)