Re: Questions on How the Namenode Assign Blocks to Datanodes

Steve Loughran Fri, 24 Jul 2009 07:40:56 -0700

Boyu Zhang wrote:

Dear Steve,


Thank you for your reply. I did worried about my email got lost, but I will
wait for an answer longer next time, thank you for reminding me : )

I understand that if you have data replica = 3, the namenode will assign the
blocks that way. However, I still have a question, if the data replica = 1,
I just use it for testing to see how HDFS works, what is the policy to
decide which datanode gets which block? Thank you so much!

If you are running your code on a datanode, it will be on the machineyou are running on (to save bandwidth). Otherwise, another machine willsomehow be picked (I forget where and how). Hadoop tries to keep thedata balanced across machines, to stop one having all the data, othershaving less. I don't know whether it goes on percentage of disk spacefree or total amount of data. You'd have to rummage in the source towork out.

Like I said, there's been discussion on improving the layout algorithms,to support plugins with different policies.

Re: Questions on How the Namenode Assign Blocks to Datanodes

Reply via email to