[ 
https://issues.apache.org/jira/browse/YARN-1027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13728992#comment-13728992
 ] 

Bikas Saha commented on YARN-1027:
----------------------------------

RMHAServiceProtocol may be a wrapper that does what main() currently does. Upon 
transitionToActive() it instantiates and starts ResourceManager object and upon 
transitionToStandby() it stops the ResourceManager object. Potential issues may 
occur in the stopping/shutdown logic in the RM and how it interacts with the 
extra wrapping layer. Since RPC services are not up and listening, clients may 
see long timeouts when connecting to standby RM's.

Alternatively, RMHAServiceProtocol can be added as another service in the RM. 
All services may be started as they currently do. Modify RM  service startup 
logic to first start this service and maybe other RPC services. The 
RMHAServiceProtocol service starts internal services when it 
transitionToActive() and stops them when transitionToStandby().

Add haEnabled configuration which is false by default. When HA is off then RM 
is always started in active state immediately. 
                
> Implement RMHAServiceProtocol
> -----------------------------
>
>                 Key: YARN-1027
>                 URL: https://issues.apache.org/jira/browse/YARN-1027
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Bikas Saha
>
> Implement existing HAServiceProtocol from Hadoop common. This protocol is the 
> single point of interaction between the RM and HA clients/services.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to