[ 
https://issues.apache.org/jira/browse/HDFS-2185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13246045#comment-13246045
 ] 

Todd Lipcon commented on HDFS-2185:
-----------------------------------

bq. I think you are missing the failure arc when transitionToStandby is called 
in InElection state.

This failure handling isn't necessary, since any failure to transition will be 
shortly followed by the health monitor entering a bad state. This is because a 
failure in state transition causes the NN to abort.

bq. Is there any scope for admin operations in ZKFC. Will ZKFC receive and 
accept a signal (manual admin/auto machine reboot) to stop services? At that 
point, in InElection state, how will it know that it needs to send 
transitionToStandby or not (based on whether it is active or not)?

I just added a section regarding manual failover operation in conjunction with 
automatic. I was hoping we could merge the automatic work back to trunk prior 
to adding this feature, though, treating it as an improvement.
                
> HA: HDFS portion of ZK-based FailoverController
> -----------------------------------------------
>
>                 Key: HDFS-2185
>                 URL: https://issues.apache.org/jira/browse/HDFS-2185
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: auto-failover, ha
>    Affects Versions: 0.24.0, 0.23.3
>            Reporter: Eli Collins
>            Assignee: Todd Lipcon
>             Fix For: Auto failover (HDFS-3042)
>
>         Attachments: Failover_Controller.jpg, hdfs-2185.txt, hdfs-2185.txt, 
> hdfs-2185.txt, hdfs-2185.txt, hdfs-2185.txt, zkfc-design.pdf, 
> zkfc-design.pdf, zkfc-design.pdf, zkfc-design.pdf, zkfc-design.tex
>
>
> This jira is for a ZK-based FailoverController daemon. The FailoverController 
> is a separate daemon from the NN that does the following:
> * Initiates leader election (via ZK) when necessary
> * Performs health monitoring (aka failure detection)
> * Performs fail-over (standby to active and active to standby transitions)
> * Heartbeats to ensure the liveness
> It should have the same/similar interface as the Linux HA RM to aid 
> pluggability.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to