[
https://issues.apache.org/jira/browse/YARN-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13864237#comment-13864237
]
Steve Loughran commented on YARN-1490:
--------------------------------------
How will the AM get notified of its existing containers? I can't seem to see
this in the code.
I can see the AM needing to know the following
# that it has been restarted with containers retained
# the list of the container allocations {{List<Container> liveContainers}}.
# the list of containers that failed during the outage. {{List<Container>
completedContainers}}.
>From that I can rebuild my model of the world (using container priorities to
>map to allocated roles)
> RM should optionally not kill all containers when an ApplicationMaster exits
> ----------------------------------------------------------------------------
>
> Key: YARN-1490
> URL: https://issues.apache.org/jira/browse/YARN-1490
> Project: Hadoop YARN
> Issue Type: Sub-task
> Reporter: Vinod Kumar Vavilapalli
> Assignee: Jian He
> Attachments: YARN-1490.1.patch, YARN-1490.2.patch, YARN-1490.3.patch
>
>
> This is needed to enable work-preserving AM restart. Some apps can chose to
> reconnect with old running containers, some may not want to. This should be
> an option.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)