Hi Jean,
The Problem is continued even the Active master is properly killed.
How can we make the Active master and backup master use the same zookeeper
ensemble.
Regards,
Shanmuganathan
---- On Tue, 02 Aug 2011 22:42:10 +0530 Jean-Daniel
Cryans<[email protected]> wrote ----
Like it says, the backup master is waiting for the primary one to
create the znode as in it considers it's not even running yet. If your
primary node is indeed running correctly, it could mean that they
aren't using the same Zookeeper ensemble or root znode.
Check that first.
J-D
On Tue, Aug 2, 2011 at 4:12 AM, shanmuganathan.r
<[email protected]> wrote:
> Hi All,
>
> I have the problem in my HBase fully distributed mode with backup
master . The HBase in the fully distributed mode is starting and working
properly. The Active Master is working and My backup master is waiting for the
active master termination. If I kill the active master process, the backup
master don't come to active which remains in the waiting state.
>
>
> I declared the backup master in the HBase/conf/Backup-masters file.
>
>
> My backup master log is
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: revision
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: hdfsUser
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: hdfsDate
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: hdfsUrl
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: date
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: hdfsRevision
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: user
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: hdfsVersion
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: url
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics:
MetricsString added: version
> 2011-08-02 16:04:16,416 INFO org.apache.hadoop.hbase.metrics: new
MBeanInfo
> 2011-08-02 16:04:16,417 INFO org.apache.hadoop.hbase.metrics: new
MBeanInfo
> 2011-08-02 16:04:16,417 INFO
org.apache.hadoop.hbase.master.metrics.MasterMetrics: Initialized
> 2011-08-02 16:04:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster:
HMaster started in backup mode. Stalling until master znode is written.
> 2011-08-02 16:04:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster:
Waiting for master address ZNode to be written (Also watching cluster state
node)
> 2011-08-02 16:07:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster:
Waiting for master address ZNode to be written (Also watching cluster state
node)
> 2011-08-02 16:10:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster:
Waiting for master address ZNode to be written (Also watching cluster state
node)
> 2011-08-02 16:13:16,418 DEBUG org.apache.hadoop.hbase.master.HMaster:
Waiting for master address ZNode to be written (Also watching cluster state
node)
>
>
>
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>
> And the one of my region server log is
>
>
> 2011-08-02 16:21:45,977 DEBUG
org.apache.hadoop.hbase.io.hfile.LruBlockCache: LRU Stats: total=958.41 KB,
free=198.43 MB, max=199.36 MB, blocks=1, accesses=10, hits=9, hitRatio=89.99%%,
cachingAccesses=10, cachingHits=9, cachingHitsRatio=89.99%%, evictions=0,
evicted=0, evictedPerRun=NaN
> 2011-08-02 16:22:40,714 WARN
org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to connect to
master. Retrying. Error was:
> java.net.ConnectException: Connection refused
> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
> at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:592)
> at
org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
> at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:404)
> at
org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupIOstreams(HBaseClient.java:328)
> at
org.apache.hadoop.hbase.ipc.HBaseClient.getConnection(HBaseClient.java:883)
> at
org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:750)
> at
org.apache.hadoop.hbase.ipc.HBaseRPC$Invoker.invoke(HBaseRPC.java:257)
> at $Proxy5.getProtocolVersion(Unknown Source)
> at org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:419)
> at org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:393)
> at org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:444)
> at
org.apache.hadoop.hbase.ipc.HBaseRPC.waitForProxy(HBaseRPC.java:349)
> at
org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1445)
> at
org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:737)
> at
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:586)
> at java.lang.Thread.run(Thread.java:636)
>
>
>
>
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>
> 1. How can we solve this problem?
>
>
> 2. Is there any extra configuration needed for this backup master
configuration in HBase?
>
>
> Thanks for your
valuable suggestions ......!
>
>
> Regards,
>
> Shanmuganathan
>
>
>
>