[
https://issues.apache.org/jira/browse/HBASE-21745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16888302#comment-16888302
]
stack commented on HBASE-21745:
-------------------------------
On...
bq. Fix assignment errors (undeployed regions, double assignments (yes, should
not be possible), etc)
HBASE-22527 adds display on master UI of 'problematic regions' which are one of
the following:
* Master thought this region opened, but no regionserver reported it.
* Master thought this region opened on Server1, but regionserver reported
Server2
* More than one regionservers reported opened this region
All above should be fixable with HBCK2 currently; what combination depends on
the particular problem. For example, HBASE-22527 has case #1 above where a meta
had a region assigned to a server no longer a member of the cluster (for
whatever reason...). A recipe in HBASE-22527 shows one fix (I think there a
more compact solution but in the heat of the moment... whatever works). For #2
and #3, Master used to tell disagreeing regionserver to kill itself because it
was in disagreement with the Master's view of the world (but I think this
killing was later undone).
I think 'Fix assignment errors (undeployed regions, double assignments (yes,
should not be possible), etc)' covered. Let me strike it out in the list above.
> Make HBCK2 be able to fix issues other than region assignment
> -------------------------------------------------------------
>
> Key: HBASE-21745
> URL: https://issues.apache.org/jira/browse/HBASE-21745
> Project: HBase
> Issue Type: Umbrella
> Components: hbase-operator-tools, hbck2
> Reporter: Duo Zhang
> Assignee: stack
> Priority: Critical
>
> This is what [~apurtell] posted on mailing-list, HBCK2 should support
> * -Rebuild meta from region metadata in the filesystem, aka offline meta
> rebuild.-
> * Fix assignment errors (undeployed regions, double assignments (yes, should
> not be possible), etc)
> * Fix region holes, overlaps, and other errors in the region chain
> * Fix failed split and merge transactions that have failed to roll back due
> to some bug (related to previous)
> * -Enumerate store files to determine file level corruption and sideline
> corrupt files-
> * -Fix hfile link problems (dangling / broken)-
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)