Benoit Sigoure created HBASE-26383:
--------------------------------------

             Summary: HBCK incorrectly reports inconsistencies for recently 
split regions following a master failover
                 Key: HBASE-26383
                 URL: https://issues.apache.org/jira/browse/HBASE-26383
             Project: HBase
          Issue Type: Bug
          Components: master
    Affects Versions: 2.4.3
            Reporter: Benoit Sigoure


When a region P splits into A and B, following a master failover the newly 
active master reports that P is in an inconsistent state. This seems to be a 
regression introduced in HBASE-25847 (cc [[email protected]]) which 
changed {{regionInfo.isParentSplit()}} to {{regionState.isSplit()}}. The region 
state after restart is CLOSED (rather than SPLIT), so both region state and 
region info should be checked, presumably with {{regionState.isSplit() || 
regionInfo.isSplit()}}. This situation resolves itself on its own when a major 
compaction occurs and P is GCed, but having the master incorrectly report 
inconsistencies is pretty bad. We had a pretty big outage due to a series of 
operator errors as our SRE team was trying to fix this inconsistency that, in 
fact, didn't even exist.

Thanks to Stack for helping look over this issue and Vlad Hanciuta for root 
causing the bug.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to