You can see Rebalancer of Hadoop at:
http://hadoop.apache.org/core/docs/r0.18.0/hdfs_user_guide.html#Rebalancer

2008/9/9 [EMAIL PROTECTED] <[EMAIL PROTECTED]>

>
> Does Hadoop distribute blocks according to how many blocks a node currently
> contains or according to how much disk space the node has remaining
> currently ?
> Suppose that I have many machines with identical CPUs but different disk
> sizes. If the blocks get distributed according to the remaining disk space,
> then the larger disk nodes would be storing more data... would this cause
> performance problems during the mapping phase ?
> Thanks,
> moonwatcher
>
>
>
>
>




-- 
Sorry for my english!! 明

Reply via email to