[ 
https://issues.apache.org/jira/browse/HDFS-1623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008607#comment-13008607
 ] 

Suresh Srinivas commented on HDFS-1623:
---------------------------------------

> with the only difference being whether an active switch is enabled.
Yes. Active/standby will be the state of the namenode and code should be 
identical.

> It would be good if the code for active/standby detection was pluggable. So 
> that different options for failover could be provided. It wouldn't be good to 
> require that a zookeeper ensemble be set up just to run a namenode.
The document does not state zookeeper is needed. HA solution requires a quorum 
service, that should be pluggable. ZK is one of the options.

> How does heartbeat deal with network partition? My understanding of it is 
> that it sends packets at intervals to the other node, and if they don't get 
> through it considers the other dead. This could create a situation where both 
> active and standby think that the other is dead, and both become active, 
> leading to divergent filesystem states on each machine.
This is discussed in the document as split brain and fencing requirements right?

> Also, the design indicates that more than 2 NN is out of scope. Why? Surely 
> it's as easy to design for N namenodes as it is for 2 namenodes.
Why do you need more than 2 NNs? Having more than 2NNs could solve need for 
outside quorum service. But number of NNs could be huge, especially in 
federated clusters.

> If you want manual failover, from the server perspective you need to do 
> nothing. Operators can have 2 namenode machines, with the namenode only 
> running on one, writing to shared storage. When the want to failover to the 
> standby they just have to ensure that the active is down and start the 
> namenode daemon on the standby.
Not sure what you are getting at here, in reference to the attached document?

> I proposed a design last week for streaming updates from an active to a 
> standby, it may be interesting to you (ZOOKEEPER-1016). It does have some 
> mentions of active/standby detection, which I should remove. It occurs to me 
> now, that this functionallity should be separated out completely from the 
> WALing and should live at the level of NameNode.java.
I do not understand this point. Will take a look at your proposal. But as 
regards to this jira, BookKeeper could be a component in the solution and not 
the only component.


> High Availability Framework for HDFS NN
> ---------------------------------------
>
>                 Key: HDFS-1623
>                 URL: https://issues.apache.org/jira/browse/HDFS-1623
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: Sanjay Radia
>            Assignee: Sanjay Radia
>         Attachments: Namenode HA Framework.pdf
>
>


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to