It is possible to roll in additional nodes into the cluster anytime you
want. Not much complexity in that.

However, existing 0.14 hadoop release will not rebalance data across these
new nodes. What that means is that the new nodes will be relatively empty
till new data arrives into the cluster. It might take a while for the new
nodes to get filled up.

Work is in progress to facilitate cluster-data rebalance when new Datanodes
are added.
One important goal of hadoop is the ability to grow a cluster over time.

Thanks,
dhruba

-----Original Message-----
From: C G [mailto:[EMAIL PROTECTED] 
Sent: Friday, August 03, 2007 3:17 AM
To: hadoop-user@lucene.apache.org
Subject: HDFS Question re adding additional storage

Is it possible to additional space to HDFS (in the form of new datanodes)
with minimal/no fuss?  In other words, if I have 8T across 16 machines, and
I want to go to 16T across 32 machines, can I roll in new machines easily,
or do I need to plan considerable downtime to rebuild things and move data
around?
   
  There are obvious implications here for how big an initial system to
build, and the costs associated with buying now and buying later.
   
  Thanks,
  C G
   

       
---------------------------------
Got a little couch potato? 
Check out fun summer activities for kids.
       
---------------------------------
Got a little couch potato? 
Check out fun summer activities for kids.

Reply via email to