It is possible to roll in additional nodes into the cluster anytime you want. Not much complexity in that.
However, existing 0.14 hadoop release will not rebalance data across these new nodes. What that means is that the new nodes will be relatively empty till new data arrives into the cluster. It might take a while for the new nodes to get filled up. Work is in progress to facilitate cluster-data rebalance when new Datanodes are added. One important goal of hadoop is the ability to grow a cluster over time. Thanks, dhruba -----Original Message----- From: C G [mailto:[EMAIL PROTECTED] Sent: Friday, August 03, 2007 3:17 AM To: hadoop-user@lucene.apache.org Subject: HDFS Question re adding additional storage Is it possible to additional space to HDFS (in the form of new datanodes) with minimal/no fuss? In other words, if I have 8T across 16 machines, and I want to go to 16T across 32 machines, can I roll in new machines easily, or do I need to plan considerable downtime to rebuild things and move data around? There are obvious implications here for how big an initial system to build, and the costs associated with buying now and buying later. Thanks, C G --------------------------------- Got a little couch potato? Check out fun summer activities for kids. --------------------------------- Got a little couch potato? Check out fun summer activities for kids.