Hua Liu created HDFS-9901: ----------------------------- Summary: Move block validation out of the heartbeat thread Key: HDFS-9901 URL: https://issues.apache.org/jira/browse/HDFS-9901 Project: Hadoop HDFS Issue Type: Improvement Components: datanode Reporter: Hua Liu Assignee: Hua Liu Priority: Minor
During heavy disk IO, we noticed hearbeat thread hangs on checkBlock method, which checks the existence and length of a block before spins off a thread to do the actual transferring. In extreme cases, the heartbeat thread hang more than 10 minutes so the namenode marked the datanode as dead and started replicating its blocks, which caused more disk IO on other nodes and can potentially brought them down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)