Have them use the same configuration? J-D
On Wed, Aug 3, 2011 at 7:17 AM, shanmuganathan.r <[email protected]> wrote: > Hi Jean, > > The Problem is continued even the Active master is properly > killed. How can we make the Active master and backup master use the same > zookeeper ensemble. > > Regards, > > Shanmuganathan > > > > ---- On Tue, 02 Aug 2011 22:42:10 +0530 Jean-Daniel > Cryans<[email protected]> wrote ---- > > > Like it says, the backup master is waiting for the primary one to > create the znode as in it considers it's not even running yet. If your > primary node is indeed running correctly, it could mean that they > aren't using the same Zookeeper ensemble or root znode. > > Check that first. > > J-D > > On Tue, Aug 2, 2011 at 4:12 AM, shanmuganathan.r > <[email protected]> wrote: > > Hi All, > > > > I have the problem in my HBase fully distributed mode with backup > master . The HBase in the fully distributed mode is starting and working > properly. The Active Master is working and My backup master is waiting for > the active master termination. If I kill the active master process, the > backup master don't come to active which remains in the waiting state. > > > > > > I declared the backup master in the HBase/conf/Backup-masters file. > > > > > > My backup master log is > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: revision > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: hdfsUser > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: hdfsDate > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: hdfsUrl > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: date > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: hdfsRevision > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: user > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: hdfsVersion > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: url > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: > MetricsString added: version > > 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: new > MBeanInfo > > 2011-08-02 16:04:16,417 INFO org.apache.hadoop.hbase.metrics: new > MBeanInfo > > 2011-08-02 16:04:16,417 INFO > org.apache.hadoop.hbase.master.metrics.MasterMetrics: Initialized > > 2011-08-02 16:04:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster: > HMaster started in backup mode. Stalling until master znode is written. > > 2011-08-02 16:04:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster: > Waiting for master address ZNode to be written (Also watching cluster state > node) > > 2011-08-02 16:07:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster: > Waiting for master address ZNode to be written (Also watching cluster state > node) > > 2011-08-02 16:10:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster: > Waiting for master address ZNode to be written (Also watching cluster state > node) > > 2011-08-02 16:13:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster: > Waiting for master address ZNode to be written (Also watching cluster state > node) > > > > > > > -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- > > > > > > And the one of my region server log is > > > > > > 2011-08-02 16:21:45,977 DEBUG > org.apache.hadoop.hbase.io.hfile.LruBlockCache: LRU Stats: total=958.41 KB, > free=198.43 MB, max=199.36 MB, blocks=1, accesses=10, hits=9, > hitRatio=89.99%%, cachingAccesses=10, cachingHits=9, > cachingHitsRatio=89.99%%, evictions=0, evicted=0, evictedPerRun=NaN > > 2011-08-02 16:22:40,714 WARN > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to connect to > master. Retrying. Error was: > > java.net.ConnectException: Connection refused > > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:592) > > at > org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206) > > at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:404) > > at > org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupIOstreams(HBaseClient.java:328) > > at > org.apache.hadoop.hbase.ipc.HBaseClient.getConnection(HBaseClient.java:883) > > at > org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:750) > > at > org.apache.hadoop.hbase.ipc.HBaseRPC$Invoker.invoke(HBaseRPC.java:257) > > at $Proxy5.getProtocolVersion(Unknown Source) > > at > org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:419) > > at > org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:393) > > at > org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:444) > > at > org.apache.hadoop.hbase.ipc.HBaseRPC.waitForProxy(HBaseRPC.java:349) > > at > org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1445) > > at > org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:737) > > at > org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:586) > > at java.lang.Thread.run(Thread.java:636) > > > > > > > > > -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- > > > > > > 1. How can we solve this problem? > > > > > > 2. Is there any extra configuration needed for this backup master > configuration in HBase? > > > > > > Thanks for your > valuable suggestions ......! > > > > > > Regards, > > > > Shanmuganathan > > > > > > > > > > >
