[
https://issues.apache.org/jira/browse/HDFS-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14111655#comment-14111655
]
Sanjay Radia commented on HDFS-6469:
------------------------------------
My thoughts:
* I do believe that Paxos based NN would give faster failover than what NN HA
offers today (30sec to a few minutes but typically no more than 1 minute or
two). So this is clearly a benefit of CNode though I have not heard a single
customer complain about the failover time so far.
* The proposed solution does not increase the write throughput.
* The parallel reads advantage of CNode can be achieved in the current HA
setup with some work (this is discussed above). If this is the main benefit
than I rather pursue enhancing the NN standby to support reads. Further there
is existing on going work to improve the locking in the NN.
* I share Todd's view that ZK is not a usable "reference implementation" for
Paxos. One really needs a paxos library that can be plugged in rather than an
external server-based solution like ZK.
So at this stage I am having a hard time seeing the benefits to justify the
costs of adding this complexity. I do however understand the overhead that
Wandisco faces in integrating their solution with HDFS each time HDFS is
modified. Would a few plugin interfaces make it easier? I would be more than
happy to support adding such plugins if they would help.
> Coordinated replication of the namespace using ConsensusNode
> ------------------------------------------------------------
>
> Key: HDFS-6469
> URL: https://issues.apache.org/jira/browse/HDFS-6469
> Project: Hadoop HDFS
> Issue Type: New Feature
> Components: namenode
> Affects Versions: 3.0.0
> Reporter: Konstantin Shvachko
> Assignee: Konstantin Shvachko
> Attachments: CNodeDesign.pdf
>
>
> This is a proposal to introduce ConsensusNode - an evolution of the NameNode,
> which enables replication of the namespace on multiple nodes of an HDFS
> cluster by means of a Coordination Engine.
--
This message was sent by Atlassian JIRA
(v6.2#6252)