You can run multiple data nodes on the same machine. You should create a separate config directory for each dn. The following stuff needs to be created hdfs-site.xml Pid/log/data/tmp dirs log4.properties Master/slaves And then start these data nodes ( something like this: bin/hdfs --config $DIR/dn$i datanode > $DIR/dn$i/log/datanode_log$i.log 2>&1&)
Or you can look at MiniDFSCluster On 9/24/09 11:00 PM, "Huang Qian" <[email protected]> wrote: > Hi guys, > > I just want to simulate a cluster with Hadoop on my laptop, so I chose the > pseudo-distribute mode. The example is running well, but now I just want to > test getting date from different machines. Unfortunately, I have not found > anything on that topic yet. Can Hadoop fit my needs under pseudo-distribute > mode? Thank you all~ > > All the best > Qian
