Hadoop default replication factor is 3 and you can configure it in hdfs-default.xml<http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/hdfs-default.xml>, youu should have one Master (NameNode and 2 Slaves (Data Nodes)
and please follow http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-multi-node-cluster/ On Mon, Dec 9, 2013 at 4:50 PM, navaz <[email protected]> wrote: > second try... > > > On Sun, Dec 8, 2013 at 1:14 PM, navaz <[email protected]> wrote: > >> Hi All >> >> I need one suggestion from his group. I am trying to set up a hadoop >> cluster in my research lab. I would like to know how many physical VMs are >> needed for this. . My main intrest to measure the shuffle phase network >> traffic. What is the basic requirement like namenode and data node . >> >> >> Thanks & Regards >> *Abdul Navaz* >> >> *Graduate Student * >> *University of Houston* >> *Houston, TX - 77204-4020* >> >> >> > > > -- > *Abdul Navaz* > *Masters in Network Communications* > *University of Houston* > *Houston, TX - 77204-4020* > *Ph - 281-685-0388 <281-685-0388>* > *[email protected]* <[email protected]> > >
