Thank you very much Todd. I hope futute versions of hadoop rebalcer will
include this check.

I have one more question.

If we are in the process of setting up additional nodes incrementally in
different rack (say rack-2) and rack-2 size is only 25% of rack-1, how
would data be balanced (with default implementation)?
i.e Will hadoop prefers balancing the overall nodes or will it try to obey
the topology first that could fillup rack-2 quickly?.  I am positive that
it will try to balance overall nodes but want to be sure.

Thanks and Regards
Ravi
On Tue, Jan 17, 2012 at 10:41 AM, Todd Lipcon <[email protected]> wrote:

> Hi Ravi,
>
> You'll probably need to up the replication level of the affected files
> and then drop it back down to the desired level. Current versions of
> HDFS do not automatically repair rack policy violations if they're
> introduced in this manner.
>
> -Todd
>
> On Mon, Jan 16, 2012 at 3:53 PM, rk vishu <[email protected]> wrote:
> > Hello All,
> >
> > If i change the rackid for some nodes and restart namenode, will data be
> > rearranged accordingly? Do i need to run rebalancer?
> >
> > Any information on this would be appreciated.
> >
> > Thanks and Regards
> > Ravi
>
>
>
> --
> Todd Lipcon
> Software Engineer, Cloudera
>

Reply via email to