Hi all,
I have problem starting a cluster of 1 master and 3 region server. When I
started the cluster, the HMaster and HRegionserver will automatically exit.
My abase-site.xml:
<configuration>
<property>
<name>hbase.master</name>
<value>hadoopmaster:60000</value>
</property>
<property>
<name>hbase.rootdir</name>
<value>hdfs://hadoopmaster:9000/hbase</value>
</property>
<property>
<name>hbase.zookeeper.property.dataDir</name>
<value>/proj/ucare/bo/hadoop_data/zookeeper</value>
</property>
<property>
<name>hbase.zookeeper.quorum</name>
<value>pc225.emulab.net<http://pc225.emulab.net>,pc273.emulab.net<http://pc273.emulab.net>,pc210.emulab.net<http://pc210.emulab.net></value>
<description>The directory shared by RegionServers.
</description>
</property>
<property>
<name>hbase.zookeeper.property.clientPort</name>
<value>2181</value>
</property>
<property>
<name>hbase.cluster.distributed</name>
<value>true</value>
</property>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
Log file:
2015-04-26 11:38:45,938 INFO
[main-SendThread(pc273.emulab.net<http://pc273.emulab.net>:2181)]
zookeeper.ClientCnxn: Opening socket connection to server
pc273.emulab.net/155.98.39.73:2181<http://pc273.emulab.net/155.98.39.73:2181>.
Will not attempt to authenticate using SASL (unknown error)
2015-04-26 11:38:45,939 INFO
[main-SendThread(pc273.emulab.net<http://pc273.emulab.net>:2181)]
zookeeper.ClientCnxn: Socket connection established to
pc273.emulab.net/155.98.39.73:2181<http://pc273.emulab.net/155.98.39.73:2181>,
initiating session
2015-04-26 11:38:45,940 INFO
[main-SendThread(pc273.emulab.net<http://pc273.emulab.net>:2181)]
zookeeper.ClientCnxn: Unable to read additional data from server sessionid 0x0,
likely server has closed socket, closing socket connection and attempting
reconnect
2015-04-26 11:38:47,022 INFO
[main-SendThread(pc225.emulab.net<http://pc225.emulab.net>:2181)]
zookeeper.ClientCnxn: Opening socket connection to server
pc225.emulab.net/155.98.39.25:2181<http://pc225.emulab.net/155.98.39.25:2181>.
Will not attempt to authenticate using SASL (unknown error)
2015-04-26 11:38:47,023 INFO
[main-SendThread(pc225.emulab.net<http://pc225.emulab.net>:2181)]
zookeeper.ClientCnxn: Socket connection established to
pc225.emulab.net/155.98.39.25:2181<http://pc225.emulab.net/155.98.39.25:2181>,
initiating session
2015-04-26 11:38:47,025 INFO
[main-SendThread(pc225.emulab.net<http://pc225.emulab.net>:2181)]
zookeeper.ClientCnxn: Unable to read additional data from server sessionid 0x0,
likely server has closed socket, closing socket connection and attempting
reconnect
2015-04-26 11:38:47,994 INFO
[main-SendThread(pc332.emulab.net<http://pc332.emulab.net>:2181)]
zookeeper.ClientCnxn: Opening socket connection to server
pc332.emulab.net/155.98.39.132:2181<http://pc332.emulab.net/155.98.39.132:2181>.
Will not attempt to authenticate using SASL (unknown error)
2015-04-26 11:39:17,150 INFO
[main-SendThread(pc332.emulab.net<http://pc332.emulab.net>:2181)]
zookeeper.ClientCnxn: Client session timed out, have not heard from server in
30025ms for sessionid 0x0, closing socket connection and attempting reconnect
2015-04-26 11:39:17,251 WARN [main] zookeeper.RecoverableZooKeeper: Possibly
transient ZooKeeper,
quorum=pc273.emulab.net<http://pc273.emulab.net>:2181,pc225.emulab.net<http://pc225.emulab.net>:2181,pc332.emulab.net<http://pc332.emulab.net>:2181,
exception=org.apache.zookeeper.KeeperException$ConnectionLossException:
KeeperErrorCode = ConnectionLoss for /hbase
2015-04-26 11:39:17,251 ERROR [main] zookeeper.RecoverableZooKeeper: ZooKeeper
create failed after 4 attempts
2015-04-26 11:39:17,255 ERROR [main] master.HMasterCommandLine: Master exiting
java.lang.RuntimeException: Failed construction of Master: class
org.apache.hadoop.hbase.master.HMaster
at
org.apache.hadoop.hbase.master.HMaster.constructMaster(HMaster.java:1982)
at
org.apache.hadoop.hbase.master.HMasterCommandLine.startMaster(HMasterCommandLine.java:198)
at
org.apache.hadoop.hbase.master.HMasterCommandLine.run(HMasterCommandLine.java:139)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at
org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:126)
at org.apache.hadoop.hbase.master.HMaster.main(HMaster.java:1996)
Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException:
KeeperErrorCode = ConnectionLoss for /hbase
at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783)
at
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.createNonSequential(RecoverableZooKeeper.java:512)
at
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.create(RecoverableZooKeeper.java:491)
at
org.apache.hadoop.hbase.zookeeper.ZKUtil.createWithParents(ZKUtil.java:1252)
at
org.apache.hadoop.hbase.zookeeper.ZKUtil.createWithParents(ZKUtil.java:1230)
at
org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.createBaseZNodes(ZooKeeperWatcher.java:174)
at
org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.<init>(ZooKeeperWatcher.java:167)
at
org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:531)
at org.apache.hadoop.hbase.master.HMaster.<init>(HMaster.java:333)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
at
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
Can someone helps me figure out the problem? Is that because the number of
region server and zookeeper nodes are incorrect?
Thanks a lot!
Bo