Bikas Saha created YARN-1367:
--------------------------------

             Summary: After restart NM should resync with the RM without 
killing containers
                 Key: YARN-1367
                 URL: https://issues.apache.org/jira/browse/YARN-1367
             Project: Hadoop YARN
          Issue Type: Sub-task
            Reporter: Bikas Saha


After RM restart, the RM sends a resync response to NMs that heartbeat to it.  
Upon receiving the resync response, the NM kills all containers and 
re-registers with the RM. The NM should be changed to not kill the container 
and instead inform the RM about all currently running containers including 
their allocations etc. After the re-register, the NM should send all pending 
container completions to the RM as usual.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to