HRegionServer.checkFileSystem() should only abort() after fs is down for some time ----------------------------------------------------------------------------------
Key: HBASE-5668 URL: https://issues.apache.org/jira/browse/HBASE-5668 Project: HBase Issue Type: Improvement Reporter: Prakash Khemani When checkFileSystem() fails then the region server should wait for sometime before aborting. By default, the timeout can be same as zookeeper session timeout. When say a rack switch reboots or fails for a few minutes, and all the traffic to the region server dies ... then we don't want the region servers to unnecessarily kill themselves when ongoing compactions or flushes fail. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira