Thank you very much Todd. I hope futute versions of hadoop rebalcer will include this check.
I have one more question. If we are in the process of setting up additional nodes incrementally in different rack (say rack-2) and rack-2 size is only 25% of rack-1, how would data be balanced (with default implementation)? i.e Will hadoop prefers balancing the overall nodes or will it try to obey the topology first that could fillup rack-2 quickly?. I am positive that it will try to balance overall nodes but want to be sure. Thanks and Regards Ravi On Tue, Jan 17, 2012 at 10:41 AM, Todd Lipcon <[email protected]> wrote: > Hi Ravi, > > You'll probably need to up the replication level of the affected files > and then drop it back down to the desired level. Current versions of > HDFS do not automatically repair rack policy violations if they're > introduced in this manner. > > -Todd > > On Mon, Jan 16, 2012 at 3:53 PM, rk vishu <[email protected]> wrote: > > Hello All, > > > > If i change the rackid for some nodes and restart namenode, will data be > > rearranged accordingly? Do i need to run rebalancer? > > > > Any information on this would be appreciated. > > > > Thanks and Regards > > Ravi > > > > -- > Todd Lipcon > Software Engineer, Cloudera >
