Ashu Pachauri created HBASE-14699:
-------------------------------------
Summary: Replication crashes regionservers when hbase.wal.provider
is set to multiwal
Key: HBASE-14699
URL: https://issues.apache.org/jira/browse/HBASE-14699
Project: HBase
Issue Type: Bug
Components: Replication
Reporter: Ashu Pachauri
Priority: Blocker
When the hbase.wal.provider is set to multiwal and replication is enabled, the
regionservers start crashing with the following exception:
{code}
<hostname>,16020,1445495411258: Failed to write replication wal position
(filename=<hostname>%2C16020%2C1445495411258.null0.1445495898373,
position=1322399)
org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode = NoNode
for
/hbase/replication/rs/<hostname>,16020,1445495411258/1/<hostname>%2C16020%2C1445495411258.null0.1445495898373
at org.apache.zookeeper.KeeperException.create(KeeperException.java:111)
at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1270)
at
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.setData(RecoverableZooKeeper.java:429)
at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:940)
at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:990)
at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:984)
at
org.apache.hadoop.hbase.replication.ReplicationQueuesZKImpl.setLogPosition(ReplicationQueuesZKImpl.java:129)
at
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.logPositionAndCleanOldLogs(ReplicationSourceManager.java:177)
at
org.apache.hadoop.hbase.replication.regionserver.ReplicationSource.run(ReplicationSource.java:388)
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)