[ 
https://issues.apache.org/jira/browse/HBASE-22657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17364655#comment-17364655
 ] 

kangTwang commented on HBASE-22657:
-----------------------------------

Hi:

    I also have this problem here. Have you solved it now??

> HBase : STUCK Region-In-Transition 
> -----------------------------------
>
>                 Key: HBASE-22657
>                 URL: https://issues.apache.org/jira/browse/HBASE-22657
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 2.0.0
>            Reporter: oktay tuncay
>            Priority: Critical
>
> When we check the number of regions in transition on Ambari, It shows 1 
> transition is waiting. (It's more than 1 in other cluster)
> And also, when check the table with command "hbase hbck -details 
> *table_name*" status looks INCONSISTENT
> _There are 0 overlap groups with 0 overlapping regions
> ERROR: Found inconsistency in table *Table_Name*
> Summary:
> Table hbase:meta is okay.
> Number of regions: 1
> Deployed on: hostname1:port, hostname2:port, hostname3:port, hostname4:port
> Table *Table_Name *is okay.
> Number of regions: 39
> Deployed on: hostname1:port, hostname2:port, hostname3:port, hostname4:port
> 2 inconsistencies detected.
> Status: *INCONSISTENT*
> When I check the logfiles, I saw following warning messages,
> 2019-06-09T07:14:15.179+02:00 WARN 
> org.apache.hadoop.hbase.master.assignment.AssignmentManager: STUCK 
> Region-In-Transition rit=CLOSING, location=*hostname*,*port*,1558699727048, 
> table=*table_name*, region=c67dd5d8bcd174cc2001695c31475ab1
> According this message, region c67dd5d8bcd174cc2001695c31475ab1 try to assign 
> *host* but this operation is stuck.
> We stopped RS process on *host* and force assign to another RS which are 
> running.
> *hbase(main):001:0> assign 'c67dd5d8bcd174cc2001695c31475ab1'*
> After that operaion, INCONSISTENT has gone and we re-started RS on host.
> One of the reasons why a region gets stuck in transition is because, when it 
> is being moved across regionservers, it is unassigned from the source 
> regionserver but is never assigned to another regionserver
> I think Below code is responsible for that process. 
> private void handleRegionOverStuckWarningThreshold(final RegionInfo 
> regionInfo) {
> final RegionStateNode regionNode = 
> regionStates.getRegionStateNode(regionInfo);
> //if (regionNode.isStuck()) {
> LOG.warn("STUCK Region-In-Transition {}", regionNode);_
> It seems one potential way of unstuck the region is to send close request to 
> the region server. May be blocked because another Procedure holds the 
> exclusive lock and is not letting go.
> My question is what is the root cause for this problem and I think, HBase 
> should be able to fix region-In-Transition issue.
> We can fix this problem by manual but some customer does not have this 
> knowledge and I think HBase needs to be recover itself.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to