[
https://issues.apache.org/jira/browse/HBASE-4124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13090698#comment-13090698
]
gaojinchao commented on HBASE-4124:
-----------------------------------
@ram
How come we have a dead RS if we dont kill the RS
gao: If you stop the cluster, The meta will handle the server information.
if the master is also killed how can the regions be assigned to some other RS
gao: When master startup, it collects the regions on a same region server and
call sendRegionOpen(destination, regions).
If the region is relatively large number, when region server opens the
reigons needs a long time.
when master crash, the new master may reopen the regions on another region
server.
> ZK restarted while assigning a region, new active HM re-assign it but the RS
> warned 'already online on this server'.
> --------------------------------------------------------------------------------------------------------------------
>
> Key: HBASE-4124
> URL: https://issues.apache.org/jira/browse/HBASE-4124
> Project: HBase
> Issue Type: Bug
> Components: master
> Reporter: fulin wang
> Assignee: gaojinchao
> Fix For: 0.90.5
>
> Attachments: HBASE-4124_Branch90V1_trial.patch,
> HBASE-4124_Branch90V2.patch, HBASE-4124_Branch90V3.patch, log.txt
>
> Original Estimate: 0.4h
> Remaining Estimate: 0.4h
>
> ZK restarted while assigning a region, new active HM re-assign it but the RS
> warned 'already online on this server'.
> Issue:
> The RS failed besause of 'already online on this server' and return; The HM
> can not receive the message and report 'Regions in transition timed out'.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira