[ 
https://issues.apache.org/jira/browse/YARN-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13864237#comment-13864237
 ] 

Steve Loughran commented on YARN-1490:
--------------------------------------

How will the AM get notified of its existing containers? I can't seem to see 
this in the code.

I can see the AM needing to know the following
# that it has been restarted with containers retained
# the list of the container allocations {{List<Container> liveContainers}}.
# the list of containers that failed during the outage. {{List<Container> 
completedContainers}}.

>From that I can rebuild my model of the world (using container priorities to 
>map to allocated roles)


> RM should optionally not kill all containers when an ApplicationMaster exits
> ----------------------------------------------------------------------------
>
>                 Key: YARN-1490
>                 URL: https://issues.apache.org/jira/browse/YARN-1490
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Vinod Kumar Vavilapalli
>            Assignee: Jian He
>         Attachments: YARN-1490.1.patch, YARN-1490.2.patch, YARN-1490.3.patch
>
>
> This is needed to enable work-preserving AM restart. Some apps can chose to 
> reconnect with old running containers, some may not want to. This should be 
> an option.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to