Bwolen Yang wrote:
Here is the setup: -------------------------- cluster: hadoop 0.12.3
Please try Hadoop 0.13.0. I don't know whether it will address your concerns, but it should be faster and is much closer to what developers are currently working on.
jdk 1.6.0_01 HDFS file replication factor: 3 7 machines total 1 machine for namenode 1 machine for jobtracker 5 other machines for slaves (datanodes / mapreduce)
For such a small cluster you'd probably be better running the jobtracker and namenode on the same node and gain another slave.
Doug
