Hi, Rebalancer should help you : http://issues.apache.org/jira/browse/HADOOP-1652
Amogh On 10/28/09 2:54 PM, "Vibhooti Verma" <[email protected]> wrote: Hi All, We are facing the issue with distribution of data in a cluster where nodes have differnt storage capacity. We have 4 nodes with 100G capacity and 1 node with 2TB capacity. The storage of the high storage capacity is not being utilized where as all low storage capccity nodes are being full. Any help/suggestion in the regard will be helpful. -- cheers, Vibhooti
