[ 
https://issues.apache.org/jira/browse/HBASE-27763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

guoxiaojiao updated HBASE-27763:
--------------------------------
    Description: 
20696 2022-07-07 14:51:52,638 ERROR.HRegionServer: ***** ABORTING region server 
***: Failed to operate on replication queue *****
120697 org.apache.hadoop.hbase.replication.ReplicationException: Failed to set 
log position (serverName=***, 
queueId=***.regiongroup-0.1657176199216,position=55122787)
120698     at 
org.apache.hadoop.hbase.replication.ZKReplicationQueueStorage.setWALPosition(ZKReplicationQueueStorage.java:261)
120699     at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.lambda$logPositionAndCleanOldLogs$7(ReplicationSourceManager.java:529)
120700     at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.interruptOrAbortWhenFail(ReplicationSourceManager.java:476)
120701     at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.logPositionAndCleanOldLogs(ReplicationSourceManager.java:528)
120702     at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceShipper.updateLogPosition(ReplicationSourceShipper.java:271)
120703     at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceShipper.shipEdits(ReplicationSourceShipper.java:205)
120704     at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceShipper.run(ReplicationSourceShipper.java:120)
120705 Caused by: org.apache.zookeeper.KeeperException$NoNodeException: 
KeeperErrorCode = NoNode
120706     at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
120707     at org.apache.zookeeper.ZooKeeper.multiInternal(ZooKeeper.java:1925)
120708     at org.apache.zookeeper.ZooKeeper.multi(ZooKeeper.java:1830)
120709     at 
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.multi(RecoverableZooKeeper.java:668)
120710     at 
org.apache.hadoop.hbase.zookeeper.ZKUtil.multiOrSequential(ZKUtil.java:1797)
120711     at 
org.apache.hadoop.hbase.replication.ZKReplicationQueueStorage.setWALPosition(ZKReplicationQueueStorage.java:251)
120712     ... 6 more

> Recover WAL encounter  KeeperErrorCode = NoNode cause RegionServer crash
> ------------------------------------------------------------------------
>
>                 Key: HBASE-27763
>                 URL: https://issues.apache.org/jira/browse/HBASE-27763
>             Project: HBase
>          Issue Type: Bug
>            Reporter: guoxiaojiao
>            Priority: Major
>
> 20696 2022-07-07 14:51:52,638 ERROR.HRegionServer: ***** ABORTING region 
> server ***: Failed to operate on replication queue *****
> 120697 org.apache.hadoop.hbase.replication.ReplicationException: Failed to 
> set log position (serverName=***, 
> queueId=***.regiongroup-0.1657176199216,position=55122787)
> 120698     at 
> org.apache.hadoop.hbase.replication.ZKReplicationQueueStorage.setWALPosition(ZKReplicationQueueStorage.java:261)
> 120699     at 
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.lambda$logPositionAndCleanOldLogs$7(ReplicationSourceManager.java:529)
> 120700     at 
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.interruptOrAbortWhenFail(ReplicationSourceManager.java:476)
> 120701     at 
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.logPositionAndCleanOldLogs(ReplicationSourceManager.java:528)
> 120702     at 
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceShipper.updateLogPosition(ReplicationSourceShipper.java:271)
> 120703     at 
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceShipper.shipEdits(ReplicationSourceShipper.java:205)
> 120704     at 
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceShipper.run(ReplicationSourceShipper.java:120)
> 120705 Caused by: org.apache.zookeeper.KeeperException$NoNodeException: 
> KeeperErrorCode = NoNode
> 120706     at 
> org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
> 120707     at 
> org.apache.zookeeper.ZooKeeper.multiInternal(ZooKeeper.java:1925)
> 120708     at org.apache.zookeeper.ZooKeeper.multi(ZooKeeper.java:1830)
> 120709     at 
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.multi(RecoverableZooKeeper.java:668)
> 120710     at 
> org.apache.hadoop.hbase.zookeeper.ZKUtil.multiOrSequential(ZKUtil.java:1797)
> 120711     at 
> org.apache.hadoop.hbase.replication.ZKReplicationQueueStorage.setWALPosition(ZKReplicationQueueStorage.java:251)
> 120712     ... 6 more



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to