[
https://issues.apache.org/jira/browse/YARN-149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13806917#comment-13806917
]
Zhijie Shen commented on YARN-149:
----------------------------------
Is this a random test failure related to some ZKRMStateStore patch?
https://builds.apache.org/job/PreCommit-YARN-Build/2291//testReport/org.apache.hadoop.yarn.server.resourcemanager.recovery/TestZKRMStateStoreZKClientConnections/testZKClientDisconnectAndReconnect/
> ResourceManager (RM) High-Availability (HA)
> -------------------------------------------
>
> Key: YARN-149
> URL: https://issues.apache.org/jira/browse/YARN-149
> Project: Hadoop YARN
> Issue Type: New Feature
> Reporter: Harsh J
> Assignee: Bikas Saha
> Attachments: rm-ha-phase1-approach-draft1.pdf,
> rm-ha-phase1-draft2.pdf, YARN ResourceManager Automatic
> Failover-rev-07-21-13.pdf, YARN ResourceManager Automatic
> Failover-rev-08-04-13.pdf
>
>
> This jira tracks work needed to be done to support one RM instance failing
> over to another RM instance so that we can have RM HA. Work includes leader
> election, transfer of control to leader and client re-direction to new leader.
--
This message was sent by Atlassian JIRA
(v6.1#6144)