[
https://issues.apache.org/jira/browse/YARN-1815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308937#comment-15308937
]
Subru Krishnan commented on YARN-1815:
--------------------------------------
The test failures seem unrelated as verified that all of them pass locally.
> Work preserving recovery of Unmanged AMs
> ----------------------------------------
>
> Key: YARN-1815
> URL: https://issues.apache.org/jira/browse/YARN-1815
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: resourcemanager
> Affects Versions: 2.3.0
> Reporter: Karthik Kambatla
> Assignee: Subru Krishnan
> Priority: Critical
> Attachments: Unmanaged AM recovery.png, YARN-1815-v3.patch,
> YARN-1815-v4.patch, yarn-1815-1.patch, yarn-1815-2.patch, yarn-1815-2.patch
>
>
> Currently work preserving RM restart recovers unmanaged AMs but it has a
> couple of shortcomings - all running containers are killed and completed
> unmanaged AMs are also recovered as we do _not_ record final state for
> unmanaged AMs in the RM StateStore. This JIRA proposes to address both the
> shortcomings so that work preserving unmanaged AM recovery works exactly like
> with managed AMs
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]