In my existing 48-node cluster, the engineer who originally designed it (no longer here) did not specify logical racks in the HDFS configuration, instead leaving everything in "default-rack". Now I have 4 physical racks of machines, and I am becoming concerned about failure and near/far replication issues.
Anyone have any ideas what will happen if I tell hadoop about the physical rack layout (i.e. nuke default-rack, and create rack110, rack111, rack112, rack113, etc)? -j