On Mar 12, 2011, at 8:43 AM, Allen Wittenauer wrote:
On Mar 3, 2011, at 2:41 PM, Suresh Srinivas wrote:
We have started pushing changes for namenode federation in to the
feature branch HDFS-1052. The work items are created as subtask of
the jira HDFS-1052 and are based on the design document published
in the same jira. By the end of this week, we will complete pushing
the changes to HDFS-1052 branch. Though the changes in these jiras
are already committed, please do provide your feedback on either
HDFS-1052 or its subtasks. New items that come out of the feedback
will be addressed in new jiras.
Current status of the development:
# The testing of this feature is underway. Most of the basic
functionality has been tested both for a single namenode cluster
(for backward compatibility) and with multiple namenodes.
# All the existing tests and newly added tests pass (same as trunk).
We plan on merging this branch to trunk after a week or two. This
will help us continue make future changes on the trunk. I will send
an announcement before merging the federation branch into trunk.
It sounds like merging into trunk is extremely premature. That
said, I'm still trying to understand the why's around this.
To me, this series of changes looks like it is going to make
running a grid much much harder for very little benefit. In
particular, I don't see the difference between running multiple NN/
DN combinations verses running federation, especially with client
side mount tables in play.
Main difference between independent HDFS clusters and HDFS federation
is that in federation one can shares the storage of the DNs and the DNs.
There is a very detailed document that describes this on the Jira.
If you are running a single NN and you don't need the scaling then
running and managing hadoop is for all practical purposes unchanged.
sanjay