Hi Huang, Boris's answer should work fine. If it would be useful for you to have a single command line tool to start up a pseudo-distributed cluster for testing, please comment on this JIRA:
http://issues.apache.org/jira/browse/MAPREDUCE-987 -Todd On Fri, Sep 25, 2009 at 10:19 AM, Boris Shkolnik <[email protected]>wrote: > You can run multiple data nodes on the same machine. > You should create a separate config directory for each dn. > The following stuff needs to be created > hdfs-site.xml > Pid/log/data/tmp dirs > log4.properties > Master/slaves > And then start these data nodes > ( something like this: bin/hdfs --config $DIR/dn$i datanode > > $DIR/dn$i/log/datanode_log$i.log 2>&1&) > > > Or you can look at MiniDFSCluster > > > On 9/24/09 11:00 PM, "Huang Qian" <[email protected]> wrote: > > > Hi guys, > > > > I just want to simulate a cluster with Hadoop on my laptop, so I chose > the > > pseudo-distribute mode. The example is running well, but now I just want > to > > test getting date from different machines. Unfortunately, I have not > found > > anything on that topic yet. Can Hadoop fit my needs under > pseudo-distribute > > mode? Thank you all~ > > > > All the best > > Qian > >
