Hadoop, hostname, DNS, and configuration file

Yunhong Gu1 Fri, 25 Jan 2008 14:31:40 -0800

I think I found a bug but I am not 100% sure. It seems to me that in somepart of the code, probably in Job/TaskTracker, Hadoop always try toresolve an IP address using the host name. If this is the case, then it isa bug, because IP address are already known at this stage (masters knowthe IP address of all slaves, while any slaves knows the master address inthe configuration file).

In fact, I have multiple IP addresses on my servers, and no DNS is setbecause it is not necessary for a rack of machines used for internalcomputing purpose.

Even though I have set explicitly the IP address that each master/slavenode should bind to, at some stage JT/TT seems still try to resolve an IPaddress using the host name. This is a possible cause of Hadoop-1374,which I have suffered from for the last two weeks.

I have this idea because after we disabled all network interfaces exceptfor the one we use for Hadoop, and started a DNS to resolve all hostnames, the problem (Hadoop-1374) disappeared.


Several suggestions:
1. do not use hostname to resolve IP.

2. there are too many places in the configuration file to set IPaddresses, but I am afraid they are not actually used by Hadoop at all.Only one IP address setting should be enough for each node.3. use IP addresses instead of host names in logs and reports. At leastadd IP addresses after the host names. In general, the error reportrelated to network problems is not accurate.

I am not very familar with Hadoop code yet. Please correct me if I amwrong.


Thanks
Yunhong

Hadoop, hostname, DNS, and configuration file

Reply via email to