You can see Rebalancer of Hadoop at: http://hadoop.apache.org/core/docs/r0.18.0/hdfs_user_guide.html#Rebalancer
2008/9/9 [EMAIL PROTECTED] <[EMAIL PROTECTED]> > > Does Hadoop distribute blocks according to how many blocks a node currently > contains or according to how much disk space the node has remaining > currently ? > Suppose that I have many machines with identical CPUs but different disk > sizes. If the blocks get distributed according to the remaining disk space, > then the larger disk nodes would be storing more data... would this cause > performance problems during the mapping phase ? > Thanks, > moonwatcher > > > > > -- Sorry for my english!! 明
