Chackaravarthy created HDFS-10365:
-------------------------------------
Summary: FullBlockReports retransmission delays NN startup time in
large cluster.
Key: HDFS-10365
URL: https://issues.apache.org/jira/browse/HDFS-10365
Project: Hadoop HDFS
Issue Type: Bug
Components: hdfs
Affects Versions: 2.6.0
Environment: version - hadoop-2.6.0
DN - 1200 nodes
Reporter: Chackaravarthy
Priority: Critical
Whenever NN is restarted, it takes huge time for NN to come back to stable
state. i.e. Last contact time remains more than 1 or 2 mins continuously for
around 3 to 4 hours. This is mainly because most of the DN's getting timeout
(60s) in blockReport (FBR) rpc call and then it keep sending FBR again.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]