[ 
https://issues.apache.org/jira/browse/YARN-2674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16432589#comment-16432589
 ] 

Shane Kumpf commented on YARN-2674:
-----------------------------------

[~chenchun] - Thanks for the patch here. We are seeing this when testing the 
Docker runtime and it results in extra Docker containers being launched on RM 
restart, which is problematic. I've validated that the logic in this patch 
resolves that issue. Any chance you'd be able to update the patch? If you don't 
have the time, I could put up a patch based on your previous patch.

> Distributed shell AM may re-launch containers if RM work preserving restart 
> happens
> -----------------------------------------------------------------------------------
>
>                 Key: YARN-2674
>                 URL: https://issues.apache.org/jira/browse/YARN-2674
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: applications, resourcemanager
>            Reporter: Chun Chen
>            Assignee: Chun Chen
>            Priority: Major
>              Labels: oct16-easy
>         Attachments: YARN-2674.1.patch, YARN-2674.2.patch, YARN-2674.3.patch, 
> YARN-2674.4.patch, YARN-2674.5.patch
>
>
> Currently, if RM work preserving restart happens while distributed shell is 
> running, distribute shell AM may re-launch all the containers, including 
> new/running/complete. We must make sure it won't re-launch the 
> running/complete containers.
> We need to remove allocated containers from 
> AMRMClientImpl#remoteRequestsTable once AM receive them from RM. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to