Hi Changguanghui, as far as I can see, you are missing your Namenode configuration. HDFS (Hadoop Distributed File System) is a requirement for Apache Hama. I think if you see in the log files, you'll notice that the BSPMaster is never coming up.
If you did not set up Hadoop yet, you can use the tutorials linked here: http://wiki.apache.org/hama/GettingStarted#Hadoop_Installation Have a look at our wiki how the hama-site.xml should look like [1]. And I would recommend you (for your 3 node cluster) to set up a single zookeeper. You can have a look at my configuration: <property> > <name>bsp.master.address</name> > <value>raynor:40000</value> > </property> > > <property> > <name>fs.default.name</name> > <value>hdfs://raynor:9001</value> > </property> > > <property> > <name>hama.zookeeper.quorum</name> > <value>raynor</value> > </property> > As you can see, I run on a host named "raynor" and my hadoop namenode is available under "hdfs://raynor:9001". If you configured it properly, you can verify by having a look in your logs or at our web interface which is available on http://localhost:40013. There you can check if you're slaves are listed. Best regards, Thomas [1] http://wiki.apache.org/hama/GettingStarted#Settings 2011/9/15 changguanghui <[email protected]> > Hi,**** > > I run the “hama-examples-0.3.0-incubating.jar test ” on my machine ,but it > will block .**** > > The hama-site.xml you can see:**** > > <?xml version="1.0"?>**** > > <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>**** > > <!--**** > > /****** > > * Copyright 2007 The Apache Software Foundation**** > > ***** > > * Licensed to the Apache Software Foundation (ASF) under one**** > > * or more contributor license agreements. See the NOTICE file**** > > * distributed with this work for additional information**** > > * regarding copyright ownership. The ASF licenses this file**** > > * to you under the Apache License, Version 2.0 (the**** > > * "License"); you may not use this file except in compliance**** > > * with the License. You may obtain a copy of the License at**** > > ***** > > * http://www.apache.org/licenses/LICENSE-2.0**** > > ***** > > * Unless required by applicable law or agreed to in writing, software**** > > * distributed under the License is distributed on an "AS IS" BASIS,**** > > * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. > **** > > * See the License for the specific language governing permissions and**** > > * limitations under the License.**** > > */**** > > -->**** > > <configuration>**** > > <property>**** > > <name>bsp.master.address</name>**** > > <value>zgq</value>**** > > <description>The address of the bsp master server. Either the**** > > literal string "local" or a host[:port] (where host is a name or**** > > IP address) for distributed mode.**** > > </description>**** > > </property>**** > > ** ** > > <property>**** > > <name>hama.zookeeper.quorum</name>**** > > <value>localhost,zxj,xwh</value>**** > > <description>Comma separated list of servers in the ZooKeeper quorum.* > *** > > For example, "host1.mydomain.com,host2.mydomain.com,host3.mydomain.com > ".**** > > By default this is set to localhost for local and pseudo-distributed > modes**** > > of operation. For a fully-distributed setup, this should be set to a > full**** > > list of ZooKeeper quorum servers. If HAMA_MANAGES_ZK is set in > hama-env.sh**** > > this is the list of servers which we will start/stop ZooKeeper on.**** > > </description>**** > > </property>**** > > ** ** > > </configuration>**** > > > ------------------------------------------------------------------------------------------------------------------------------------ > **** > > The following is the result when I run it, the supersteps number seems to > be 0 forever: **** > > zgq hama-0.3.0-incubating/bin# ./start-bspd.sh**** > > zxj: starting zookeeper, logging to > /home/cgh/hama-0.3.0-incubating/bin/../logs/hama-root-zookeeper-zxj.out*** > * > > xwh: starting zookeeper, logging to > /home/cgh/hama-0.3.0-incubating/bin/../logs/hama-root-zookeeper-xwh.out*** > * > > localhost: starting zookeeper, logging to > /home/cgh/hama-0.3.0-incubating/bin/../logs/hama-root-zookeeper-zgq..out** > ** > > starting bspmaster, logging to > /home/cgh/hama-0.3.0-incubating/bin/../logs/hama-root-bspmaster-zgq..out** > ** > > 2011-09-15 16:17:12.668:INFO::Logging to STDERR via > org.mortbay.log.StdErrLog**** > > 2011-09-15 16:17:12.712:INFO::jetty-0.3.0-incubating**** > > 2011-09-15 16:17:12.844:INFO::Started SelectChannelConnector@zgq:40013**** > > zxj: starting groom, logging to > /home/cgh/hama-0.3.0-incubating/bin/../logs/hama-root-groom-zxj.out**** > > localhost: starting groom, logging to > /home/cgh/hama-0.3.0-incubating/bin/../logs/hama-root-groom-zgq..out**** > > xwh: starting groom, logging to > /home/cgh/hama-0.3.0-incubating/bin/../logs/hama-root-groom-xwh.out**** > > zgq hama-0.3.0-incubating/bin# ./hama jar > ../hama-examples-0.3.0-incubating.jar test**** > > 11/09/15 16:17:31 INFO bsp.BSPJobClient: Running job: job_201109151617_0001 > **** > > 11/09/15 16:17:34 INFO bsp.BSPJobClient: Current supersteps number: 0**** > > ** ** > > ** ** > > Thank you very much!**** > > ** ** > > Changguanghui**** > > ** ** > > *发件人:* Thomas Jungblut [mailto:[email protected]] > *发送时间:* 2011年9月15日 16:10 > *收件人:* [email protected] > *抄送:* changguanghui > *主题:* Re: Hama help (how the distributed mode is working)**** > > ** ** > > Hi, > > there are several examples in the HAMA tar ball. Which version did you use? > Which example did you use? > How is your cluster configured? > Please post your hama-site.xml of all hosts involved in your cluster. > > Thanks and best regards, > Thomas > > **** > > 2011/9/15 changguanghui <[email protected]>**** > > Hi, > I can't test the example which provide in HAMA tar ball on three machine. > The trouble is how can I config the distributed HAMA ? > Could you tell me some details for setup the HAMA on three machine. Thank > you! > > -----邮件原件----- > 发件人: Thomas Jungblut [mailto:[email protected]] > 发送时间: 2011年9月14日 18:00 > 收件人: Luis Eduardo Pineda Morales > 抄送: [email protected] > 主题: Re: Hama help (Local mode not working) > > Hi Luis, > > > > - For mere consistency of the page, you might want to use the tag <tt> > > (used in the rest of the document) instead of the <em> that you are using > > for names of files and configuration properties. > > > > Thanks, I will take care of that. > > - I don't know if this is only my problem, but when I execute Hama with the > > Local configuration, the Master doesn't run (and neither does the Groom). > > They don't recognize "local" as a valid hostname, both fail with this > > exception: > > > > "local" itself is no hostname, there is a bug in our handling of this mode. > Actually nothing should be launched then. I'll extend this in our wiki. > What you are searching for is the pseudo-distributed mode which runs a > Master, Groom and Zookeeper on your machine. > You then have to provide "localhost" as the hostname or the real hostname > of > your machine. > > Is this maybe a problem with version 0.3? Would you suggest me to use 0.2 > > instead? > > > > In 0.2 is no local mode, so you won't face these problems. > Since this is a twiddle in your configuration, which should be solved using > "localhost" instead of "local", you don't need to downgrade. > > I hope it will help you. > > Regards, > Thomas > > 2011/9/14 Luis Eduardo Pineda Morales <[email protected]> > > > Thanks for you prompt reply Thomas, > > > > The wiki is more clarifying now that you added the part of the Modes. > > However, if I may, I have a couple of remarks to mention: > > > > - For mere consistency of the page, you might want to use the tag <tt> > > (used in the rest of the document) instead of the <em> that you are using > > for names of files and configuration properties. > > > > - I don't know if this is only my problem, but when I execute Hama with > the > > Local configuration, the Master doesn't run (and neither does the Groom). > > They don't recognize "local" as a valid hostname, both fail with this > > exception: > > > > From* bspmaster.log:* > > > > *FATAL org.apache.hama.BSPMasterRunner: java.net.UnknownHostException: > > Invalid hostname for server: local* > > at org.apache.hadoop.ipc.Server.bind(Server.java:198) > > at org.apache.hadoop.ipc.Server$Listener.<init>(Server.java:253) > > at org.apache.hadoop.ipc.Server.<init>(Server.java:1026) > > at org.apache.hadoop.ipc.RPC$Server.<init>(RPC.java:488) > > at org.apache.hadoop.ipc.RPC.getServer(RPC.java:450) > > at org.apache.hadoop.ipc.RPC.getServer(RPC.java:441) > > at org.apache.hama.bsp.BSPMaster.<init>(BSPMaster.java:250) > > at org.apache.hama.bsp.BSPMaster.startMaster(BSPMaster.java:421) > > at org.apache.hama.bsp.BSPMaster.startMaster(BSPMaster.java:415) > > at org.apache.hama.BSPMasterRunner.run(BSPMasterRunner.java:46) > > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) > > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) > > at org.apache.hama.BSPMasterRunner.main(BSPMasterRunner.java:56) > > > > > > From *groom.log* > > > > ERROR org.apache.hama.bsp.GroomServer: Got fatal exception while > > reinitializing GroomServer: java.net.UnknownHostException: unknown host: > > local > > at > org.apache.hadoop.ipc.Client$Connection.<init>(Client.java:195) > > at org.apache.hadoop.ipc.Client.getConnection(Client.java:850) > > at org.apache.hadoop.ipc.Client.call(Client.java:720) > > at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220) > > at $Proxy4.getProtocolVersion(Unknown Source) > > at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:359) > > at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:346) > > at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:383) > > at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:314) > > at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:291) > > at > org.apache.hama.bsp.GroomServer.initialize(GroomServer.java:279) > > at org.apache.hama.bsp.GroomServer.run(GroomServer.java:600) > > at java.lang.Thread.run(Thread.java:680) > > > > > > I've tested it in Debian, Ubuntu and MacOS Terminal. Is this maybe a > > problem with version 0.3? Would you suggest me to use 0.2 instead? > > > > > > I'm copying this to the user mailing list too, hope you don't mind. > > > > Luis > > > > > > -- > Thomas Jungblut > Berlin > > mobile: 0170-3081070 > > business: [email protected] > private: [email protected]**** > > > > > -- > Thomas Jungblut > Berlin > > mobile: 0170-3081070 > > business: [email protected] > private: [email protected]**** > -- Thomas Jungblut Berlin mobile: 0170-3081070 business: [email protected] private: [email protected]
