[
https://issues.apache.org/jira/browse/HBASE-2575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12869412#action_12869412
]
Benoit Sigoure commented on HBASE-2575:
---------------------------------------
I believe we ran into a similar problem at StumbleUpon where the filesystem of
one of the region servers sort of got into a wedged state. Any idea as to what
could be causing this or how to fix it? Any idea on how to reproduce the
problem easily (that is without physically pulling harddrives out)?
> Fault scenario of dead root drive on RS causes cluster lockup
> -------------------------------------------------------------
>
> Key: HBASE-2575
> URL: https://issues.apache.org/jira/browse/HBASE-2575
> Project: Hadoop HBase
> Issue Type: Bug
> Components: regionserver
> Affects Versions: 0.21.0
> Reporter: Todd Lipcon
> Priority: Critical
>
> We performed a fault test where we physically pulled the root drive out of a
> machine while it was on. The regionserver continued to run fine with existing
> clients. But any new clients that tried to connect to it for RPC would not
> work correctly. So when I started a new client, that client made no progress.
> Despite this, the RS continued to happily heartbeat to the master, so the
> master did not remove it from the cluster. Note that in this case, we were
> logging to NFS, and the logs continued to write, but no exceptions shown.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.