[ 
https://issues.apache.org/jira/browse/HBASE-21627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16727770#comment-16727770
 ] 

Duo Zhang commented on HBASE-21627:
-----------------------------------

AFAICT this could happen...
I haven't treated the logic for assigning meta replicas very carefully as in 
our internal deploy we do not enable this feature...

Anyway I think we should treat more seriously now...

> race condition between a recovered RIT for meta replica, and master startup
> ---------------------------------------------------------------------------
>
>                 Key: HBASE-21627
>                 URL: https://issues.apache.org/jira/browse/HBASE-21627
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Sergey Shelukhin
>            Priority: Major
>
> Master recovers RIT for a meta replica
> {noformat}
> 2018-12-14 23:16:12,008 INFO  [master/...:17000:becomeActiveMaster] 
> assignment.AssignmentManager: Attach pid=83796, ppid=83788, 
> state=RUNNABLE:REGION_STATE_TRANSITION_OPEN, hasLock=false; 
> TransitRegionStateProcedure table=hbase:meta, region=(region), ASSIGN to 
> rit=OFFLINE, location=null, table=hbase:meta, region=(region) to restore RIT
> 2018-12-14 23:16:16,475 WARN  [PEWorker-8] 
> assignment.TransitRegionStateProcedure: No location specified for {ENCODED => 
> (region), NAME => 'hbase:meta,,1_0001', STARTKEY => '', ENDKEY => '', 
> REPLICA_ID => 1}, jump back to state 
> REGION_STATE_TRANSITION_GET_ASSIGN_CANDIDATE to get one
> ...
> 2018-12-14 23:16:30,010 INFO  [PEWorker-16] procedure2.ProcedureExecutor: 
> Finished pid=83796, ppid=83788, state=SUCCESS, hasLock=false; 
> TransitRegionStateProcedure table=hbase:meta, region=(region), ASSIGN in 
> 8mins, 23.39sec
> {noformat}
> Then tries to assign replicas..
> {noformat}
> 2018-12-14 23:16:36,091 ERROR [master/...:17000:becomeActiveMaster] 
> master.HMaster: Failed to become active master
> org.apache.hadoop.hbase.client.DoNotRetryRegionException: Unexpected state 
> for rit=OPEN, location=server,17020,1544858156805, table=hbase:meta, 
> region=(region)
>                 at 
> org.apache.hadoop.hbase.master.assignment.AssignmentManager.preTransitCheck(AssignmentManager.java:548)
>                 at 
> org.apache.hadoop.hbase.master.assignment.AssignmentManager.assign(AssignmentManager.java:563)
>                 at 
> org.apache.hadoop.hbase.master.MasterMetaBootstrap.assignMetaReplicas(MasterMetaBootstrap.java:84)
>                 at 
> org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitialization(HMaster.java:1146)
> {noformat}
> Unfortunately I misplaced the log from this after copy-pasting a grep result 
> so that's all I have for this.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to