He Liangliang created HBASE-13200: ------------------------------------- Summary: Improper configuration can leads to endless lease recovery during failover Key: HBASE-13200 URL: https://issues.apache.org/jira/browse/HBASE-13200 Project: HBase Issue Type: Bug Components: MTTR Reporter: He Liangliang Assignee: He Liangliang
When a node (DN+RS) has machine/OS level failure, another RS will try to do lease recovery for the log file. It will retry for every hbase.lease.recovery.dfs.timeout (default to 61s) from the second time. When the hdfs configuration is not properly configured (e.g. socket connection timeout) and without patch HDFS-4721, the lease recovery time can exceeded the timeout specified by hbase.lease.recovery.dfs.timeout. This will lead to endless retries and preemptions until the final timeout. -- This message was sent by Atlassian JIRA (v6.3.4#6332)