Todd Lipcon created HDFS-4176: --------------------------------- Summary: EditLogTailer should call rollEdits with a timeout Key: HDFS-4176 URL: https://issues.apache.org/jira/browse/HDFS-4176 Project: Hadoop HDFS Issue Type: Bug Components: ha, name-node Affects Versions: 2.0.2-alpha, 3.0.0 Reporter: Todd Lipcon
When the EditLogTailer thread calls rollEdits() on the active NN via RPC, it currently does so without a timeout. So, if the active NN has frozen (but not actually crashed), this call can hang forever. This can then potentially prevent the standby from becoming active. This may actually considered a side effect of HADOOP-6762 -- if the RPC were interruptible, that would also fix the issue. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira