[
https://issues.apache.org/jira/browse/HBASE-22631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16905726#comment-16905726
]
yuhuiyang edited comment on HBASE-22631 at 8/13/19 1:56 AM:
------------------------------------------------------------
[~wchevreuil] Thanks for your attention .
I am sorry the rs log is lost for the reinstallation of the physical machine.
As shown in the picture above the rs show the missing table descriptor
exception message .
I think it is different from link title . The incorrect message is saved in
master memory structure serverMap and if we restart master before tempt20
crashes , then when tempt20 crashes the gced region will not appear . Because
it will load the meta table when master restarts . The meta infomation is
correct for this region . So it will create a correct serverMap according to
meta.
was (Author: yu-huiyang):
[~wchevreuil] Thanks for your attention .I think it is different from [link
title|https://issues.apache.org/jira/browse/HBASE-21843] . The incorrect
message is saved in master memory structure serverMap and if we restart master
before tempt20 crashes , then when tempt20 crashes the gced region will not
appear . Because it will load the meta table when master restarts . The meta
infomation is correct for this region . So it will create a correct serverMap
according to meta.
> assign failed may make gced parent region appear again !!!
> ----------------------------------------------------------
>
> Key: HBASE-22631
> URL: https://issues.apache.org/jira/browse/HBASE-22631
> Project: HBase
> Issue Type: Bug
> Components: proc-v2
> Affects Versions: 2.1.1
> Reporter: yuhuiyang
> Priority: Major
> Attachments: HBASE-22631-branch-2.1-01.patch, assign.png,
> assignProcedure.txt, serverCrash.png, splitAndGc.png
>
>
> When i assign a region A the process is as follows:
> step1 : A is assigned to rs1 , and rs1 fails to open it .
> step2 : assignProcedure handleFailure .
> step3 : A is assign to rs2 and rs success to open it .
> Above is the normal flow . However when rs1 is restart after the reigon A was
> split and GCRegionProcedure was successed , the region A appare again !
> The region is that reigon A is not removed from the serverMap correctly when
> assignprocedure handleFailure . Because the code regionNode.offline() make
> the regionNode's regionLocation to be null and make regionNode's state to
> OFFLINE . So when the code
> env.getAssignmentManager().undoRegionAsOpening(regionNode) do nothing . So
> when the rs1 restart event triggers a serverCrashProcedure, it will get
> reigons from serverMap and it will get the region A then A will be assigned
> and hdfs dir will be created.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)