Hello,
I am having a problem...when I configure more than one node on my hadoop cluster, the reduce jobs all stall. Ther logs look EXACTLY like the logs described by Amit Kumar Singh. in his post last month. I have checked that the disk is fine (I formatted the HDFS and deleted all the data on the data nodes to make sure) and this is not a firewall issue or a problem with etc/hosts. It works fine with one node, but adding a second node starts the stalling. I have tried using both .16.2 and .16.4, to no avail..... HELP!
