[
https://issues.apache.org/jira/browse/HBASE-3331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13082090#comment-13082090
]
Ming Ma commented on HBASE-3331:
--------------------------------
I tested it couple times on trunk. Couldn't repro it. The system will recover
in couple minutes.
> Kill -STOP of RS hosting META does not recover
> ----------------------------------------------
>
> Key: HBASE-3331
> URL: https://issues.apache.org/jira/browse/HBASE-3331
> Project: HBase
> Issue Type: Bug
> Affects Versions: 0.90.0
> Reporter: Todd Lipcon
> Priority: Critical
> Fix For: 0.92.0
>
> Attachments: timeouts.log.txt
>
>
> If you find the server hosting META and kill -STOP its region server, it will
> eventually lose its ZK session and the master will split its logs and try to
> reassign. However, at some point along here it tries to access the old META,
> and gets SocketTimeoutExceptions, which cause it to keep retrying forever.
> Once I kill -9ed the stopped server, things came back to life.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira