[
https://issues.apache.org/jira/browse/HBASE-14699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977852#comment-14977852
]
Yu Li commented on HBASE-14699:
-------------------------------
Hi [~ashu210890],
The issue of ReplicationManager#cleanOldLogs is already found and addressed by
HBASE-6617 (refer to [this
comment|https://issues.apache.org/jira/browse/HBASE-6617?focusedCommentId=14708924&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14708924]).
HBASE-6617 is already integrated into branch-1 and master, but not branch-1.2.
I believe branch-1 won't have such issue, you could give it a try if possible.
[~busbey], feel free to let me know if would like to take HBASE-6617 into
branch-1.2, I could make a quick patch although there may be some rebase work
:-)
> Replication crashes regionservers when hbase.wal.provider is set to multiwal
> ----------------------------------------------------------------------------
>
> Key: HBASE-14699
> URL: https://issues.apache.org/jira/browse/HBASE-14699
> Project: HBase
> Issue Type: Bug
> Components: Replication
> Reporter: Ashu Pachauri
> Assignee: Ashu Pachauri
> Priority: Blocker
>
> When the hbase.wal.provider is set to multiwal and replication is enabled,
> the regionservers start crashing with the following exception:
> {code}
> <hostname>,16020,1445495411258: Failed to write replication wal position
> (filename=<hostname>%2C16020%2C1445495411258.null0.1445495898373,
> position=1322399)
> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode =
> NoNode for
> /hbase/replication/rs/<hostname>,16020,1445495411258/1/<hostname>%2C16020%2C1445495411258.null0.1445495898373
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:111)
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
> at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1270)
> at
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.setData(RecoverableZooKeeper.java:429)
> at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:940)
> at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:990)
> at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:984)
> at
> org.apache.hadoop.hbase.replication.ReplicationQueuesZKImpl.setLogPosition(ReplicationQueuesZKImpl.java:129)
> at
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.logPositionAndCleanOldLogs(ReplicationSourceManager.java:177)
> at
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSource.run(ReplicationSource.java:388)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)