Hi, Have you checked if your ZK quorum is properly running, before trying to start HBase? Also, the hostnames defined for ZK quorum nodes seem quite unusual. Shouldn't these be as follows?
<property> <name>hbase.zookeeper.quorum</name> <value>pc225.emulab.net,pc273.emulab.net,pc210.emulab.net</value> <description>The directory shared by RegionServers. </description> </property> Regards, Wellington. On 26 Apr 2015, at 19:08, Bo Fu <[email protected]> wrote: > Hi all, > > I have problem starting a cluster of 1 master and 3 region server. When I > started the cluster, the HMaster and HRegionserver will automatically exit. > > My abase-site.xml: > <configuration> > <property> > <name>hbase.master</name> > <value>hadoopmaster:60000</value> > </property> > <property> > <name>hbase.rootdir</name> > <value>hdfs://hadoopmaster:9000/hbase</value> > </property> > <property> > <name>hbase.zookeeper.property.dataDir</name> > <value>/proj/ucare/bo/hadoop_data/zookeeper</value> > </property> > <property> > <name>hbase.zookeeper.quorum</name> > > <value>pc225.emulab.net<http://pc225.emulab.net>,pc273.emulab.net<http://pc273.emulab.net>,pc210.emulab.net<http://pc210.emulab.net></value> > <description>The directory shared by RegionServers. > </description> > </property> > <property> > <name>hbase.zookeeper.property.clientPort</name> > <value>2181</value> > </property> > <property> > <name>hbase.cluster.distributed</name> > <value>true</value> > </property> > <property> > <name>dfs.replication</name> > <value>1</value> > </property> > </configuration> > > > Log file: > > 2015-04-26 11:38:45,938 INFO > [main-SendThread(pc273.emulab.net<http://pc273.emulab.net>:2181)] > zookeeper.ClientCnxn: Opening socket connection to server > pc273.emulab.net/155.98.39.73:2181<http://pc273.emulab.net/155.98.39.73:2181>. > Will not attempt to authenticate using SASL (unknown error) > 2015-04-26 11:38:45,939 INFO > [main-SendThread(pc273.emulab.net<http://pc273.emulab.net>:2181)] > zookeeper.ClientCnxn: Socket connection established to > pc273.emulab.net/155.98.39.73:2181<http://pc273.emulab.net/155.98.39.73:2181>, > initiating session > 2015-04-26 11:38:45,940 INFO > [main-SendThread(pc273.emulab.net<http://pc273.emulab.net>:2181)] > zookeeper.ClientCnxn: Unable to read additional data from server sessionid > 0x0, likely server has closed socket, closing socket connection and > attempting reconnect > 2015-04-26 11:38:47,022 INFO > [main-SendThread(pc225.emulab.net<http://pc225.emulab.net>:2181)] > zookeeper.ClientCnxn: Opening socket connection to server > pc225.emulab.net/155.98.39.25:2181<http://pc225.emulab.net/155.98.39.25:2181>. > Will not attempt to authenticate using SASL (unknown error) > 2015-04-26 11:38:47,023 INFO > [main-SendThread(pc225.emulab.net<http://pc225.emulab.net>:2181)] > zookeeper.ClientCnxn: Socket connection established to > pc225.emulab.net/155.98.39.25:2181<http://pc225.emulab.net/155.98.39.25:2181>, > initiating session > 2015-04-26 11:38:47,025 INFO > [main-SendThread(pc225.emulab.net<http://pc225.emulab.net>:2181)] > zookeeper.ClientCnxn: Unable to read additional data from server sessionid > 0x0, likely server has closed socket, closing socket connection and > attempting reconnect > 2015-04-26 11:38:47,994 INFO > [main-SendThread(pc332.emulab.net<http://pc332.emulab.net>:2181)] > zookeeper.ClientCnxn: Opening socket connection to server > pc332.emulab.net/155.98.39.132:2181<http://pc332.emulab.net/155.98.39.132:2181>. > Will not attempt to authenticate using SASL (unknown error) > 2015-04-26 11:39:17,150 INFO > [main-SendThread(pc332.emulab.net<http://pc332.emulab.net>:2181)] > zookeeper.ClientCnxn: Client session timed out, have not heard from server in > 30025ms for sessionid 0x0, closing socket connection and attempting reconnect > 2015-04-26 11:39:17,251 WARN [main] zookeeper.RecoverableZooKeeper: Possibly > transient ZooKeeper, > quorum=pc273.emulab.net<http://pc273.emulab.net>:2181,pc225.emulab.net<http://pc225.emulab.net>:2181,pc332.emulab.net<http://pc332.emulab.net>:2181, > exception=org.apache.zookeeper.KeeperException$ConnectionLossException: > KeeperErrorCode = ConnectionLoss for /hbase > 2015-04-26 11:39:17,251 ERROR [main] zookeeper.RecoverableZooKeeper: > ZooKeeper create failed after 4 attempts > 2015-04-26 11:39:17,255 ERROR [main] master.HMasterCommandLine: Master exiting > java.lang.RuntimeException: Failed construction of Master: class > org.apache.hadoop.hbase.master.HMaster > at > org.apache.hadoop.hbase.master.HMaster.constructMaster(HMaster.java:1982) > at > org.apache.hadoop.hbase.master.HMasterCommandLine.startMaster(HMasterCommandLine.java:198) > at > org.apache.hadoop.hbase.master.HMasterCommandLine.run(HMasterCommandLine.java:139) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) > at > org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:126) > at org.apache.hadoop.hbase.master.HMaster.main(HMaster.java:1996) > Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException: > KeeperErrorCode = ConnectionLoss for /hbase > at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) > at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) > at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783) > at > org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.createNonSequential(RecoverableZooKeeper.java:512) > at > org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.create(RecoverableZooKeeper.java:491) > at > org.apache.hadoop.hbase.zookeeper.ZKUtil.createWithParents(ZKUtil.java:1252) > at > org.apache.hadoop.hbase.zookeeper.ZKUtil.createWithParents(ZKUtil.java:1230) > at > org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.createBaseZNodes(ZooKeeperWatcher.java:174) > at > org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.<init>(ZooKeeperWatcher.java:167) > at > org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:531) > at org.apache.hadoop.hbase.master.HMaster.<init>(HMaster.java:333) > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native > Method) > at > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57) > at > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) > > > Can someone helps me figure out the problem? Is that because the number of > region server and zookeeper nodes are incorrect? > > Thanks a lot! > > Bo > >
