[
https://issues.apache.org/jira/browse/HBASE-24245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126861#comment-17126861
]
Alex Batyrshin commented on HBASE-24245:
----------------------------------------
Got the same issue at hbase-1.4.12. Can't find anything strage at zookeeper
logs
{{Jun 04 21:13:04 hbase01 hbase[54094]: 2020-06-04 21:13:04,378 FATAL
[main-EventThread.replicationSource,kafkaproxy.replicationSource.hbase01.intra%2C60020%2C1591106833218.regiongroup-1,kafkaproxy]
regionserver.HRegionServer: ABORTING region server
hbase01.intra,60020,1591106833218: Failed to write replication wal position
(filename=hbase01.intra%2C60020%2C1591106833218.regiongroup-1.1591293776494,
position=115406460)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]:
org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode = NoNode
for
/hbase/replication/rs/hbase01.intra,60020,1591106833218/kafkaproxy/hbase01.intra%2C60020%2C1591106833218.regiongroup-1.1591293776494}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.zookeeper.KeeperException.create(KeeperException.java:111)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.zookeeper.KeeperException.create(KeeperException.java:51)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1327)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.setData(RecoverableZooKeeper.java:422)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:824)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:874)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:868)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.hadoop.hbase.replication.ReplicationQueuesZKImpl.setLogPosition(ReplicationQueuesZKImpl.java:155)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.logPositionAndCleanOldLogs(ReplicationSourceManager.java:194)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.hadoop.hbase.replication.regionserver.ReplicationSource$ReplicationSourceShipperThread.updateLogPosition(ReplicationSource.java:731)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.hadoop.hbase.replication.regionserver.ReplicationSource$ReplicationSourceShipperThread.shipEdits(ReplicationSource.java:702)}}
{{Jun 04 21:13:04 hbase01 hbase[54094]: at
org.apache.hadoop.hbase.replication.regionserver.ReplicationSource$ReplicationSourceShipperThread.run(ReplicationSource.java:555)}}
> Failed to write replication wal position
> ----------------------------------------
>
> Key: HBASE-24245
> URL: https://issues.apache.org/jira/browse/HBASE-24245
> Project: HBase
> Issue Type: Bug
> Components: Replication
> Affects Versions: 1.5.0
> Environment: apache-phoenix-4.15.0-HBase-1.5
> hadoop-2.10.0
> hbase-1.5.0-bin
> zookeeper-3.4.14
> Reporter: luodx
> Priority: Critical
> Fix For: 1.4.10
>
>
> I turned on replication for tables, index tables did not, wrote data using
> phoneix, and hbase regionserver went down after running for a while.
> regionserver log:
> 2020-04-20 19:22:49,531 FATAL
> [main-EventThread.replicationSource,1.replicationSource.a2-test-regionserver-52-23.sh%2C16020%2C1587378758523,1]
> regionserver.HRegionServe
> r: ABORTING region server a2-
> test-regionserver-52-23.sh,16020,1587378758523: Failed to write replication
> wal position (filename=a2-test-regionserver-52-23.sh%2C16020%2C
> 1587378758523.1587381739689, position=3388929)
> org.apache.zookeeper. KeeperException$NoNodeException: KeeperErrorCode =
> NoNode for
> /hbase/replication/rs/a2-test-regionserver-52-23.sh,16020,1587378758523/1/a2-test-reg
>
> ionserver-52-23.sh%2C16020%2C1587378758523. 1587381739689
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:111)
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
> at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1327)
> at
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.setData(RecoverableZooKeeper.java:510)
>
> at org.apache.hadoop.hbase.zookeeper.ZKUtil .setData(ZKUtil.java:867)
> at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:917)
> at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:911 )
> at
> org.apache.hadoop.hbase.replication.ReplicationQueuesZKImpl.setLogPosition(ReplicationQueuesZKImpl.java:155)
>
> at
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.logPositionAndCleanOldLogs(ReplicationSourceManager.java:197)
> at
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSource$ReplicationSourceShipperThread.updateLogPosition(ReplicationSource.java:791)
>
> at
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSource$ReplicationSourceShipperThread.shipEdits(ReplicationSource.java:762)
>
> at
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSource$ReplicationSourceShipperThread.run(ReplicationSource.java:601)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)