Re: rack-awareness for hdfs

Doug Cutting Mon, 17 Sep 2007 20:45:13 -0700

Jeff Hammerbacher wrote:

has anyone leveraged the ability of datanodes to specify which datacenter
and rack they live in?  if so, any evidence of performance improvements?  it
seems that rack-awareness is only leveraged in block replication, not in
task execution.

It often doesn't make a big improvement for map input, since in thecommon configuration, map tasks can nearly always be scheduled on nodeswhere the data is local. However, if you have a large HDFS cluster andoverlay smaller mapreduce clusters over subsets of the hosts, thenrack-locality can help map input performance too.


Doug

Re: rack-awareness for hdfs

Reply via email to