Benoit Sigoure created HBASE-26383:
--------------------------------------
Summary: HBCK incorrectly reports inconsistencies for recently
split regions following a master failover
Key: HBASE-26383
URL: https://issues.apache.org/jira/browse/HBASE-26383
Project: HBase
Issue Type: Bug
Components: master
Affects Versions: 2.4.3
Reporter: Benoit Sigoure
When a region P splits into A and B, following a master failover the newly
active master reports that P is in an inconsistent state. This seems to be a
regression introduced in HBASE-25847 (cc [[email protected]]) which
changed {{regionInfo.isParentSplit()}} to {{regionState.isSplit()}}. The region
state after restart is CLOSED (rather than SPLIT), so both region state and
region info should be checked, presumably with {{regionState.isSplit() ||
regionInfo.isSplit()}}. This situation resolves itself on its own when a major
compaction occurs and P is GCed, but having the master incorrectly report
inconsistencies is pretty bad. We had a pretty big outage due to a series of
operator errors as our SRE team was trying to fix this inconsistency that, in
fact, didn't even exist.
Thanks to Stack for helping look over this issue and Vlad Hanciuta for root
causing the bug.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)