Sun Xin created HBASE-27476:
-------------------------------
Summary: Recovered replication may be blocked if enabled
hbase.separate.oldlogdir.by.regionserver
Key: HBASE-27476
URL: https://issues.apache.org/jira/browse/HBASE-27476
Project: HBase
Issue Type: Bug
Components: Replication
Affects Versions: 2.4.15, 3.0.0-alpha-3
Reporter: Sun Xin
Assignee: Sun Xin
In other PR, I got a failed UT
{code:java}
[ERROR] Failures:
[ERROR]
org.apache.hadoop.hbase.replication.TestReplicationKillMasterRSWithSeparateOldWALs.killOneMasterRS
[ERROR] Run 1:
TestReplicationKillMasterRSWithSeparateOldWALs>TestReplicationKillMasterRS.killOneMasterRS:47->TestReplicationKillRS.loadTableAndKillRS:84
Waited too much time for queueFailover replication. Waited 61065ms.
[ERROR] Run 2:
TestReplicationKillMasterRSWithSeparateOldWALs>TestReplicationKillMasterRS.killOneMasterRS:47->TestReplicationKillRS.loadTableAndKillRS:84
Waited too much time for queueFailover replication. Waited 58864ms.
[ERROR] Run 3:
TestReplicationKillMasterRSWithSeparateOldWALs>TestReplicationKillMasterRS.killOneMasterRS:47->TestReplicationKillRS.loadTableAndKillRS:84
Waited too much time for queueFailover replication. Waited 57103ms. {code}
This should be caused by a bug.
If enabled {_}hbase.separate.oldlogdir.by.regionserver{_}, old wals will be
moved into different dir by regionserver name like root/oldWALs/server1/wal1 .
For recovered replication, can't convert wal path(like root/oldWALs/wal1) into
such paths, and throws FileNotFoundException.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)