Andrew Wang created HDFS-4816: --------------------------------- Summary: transitionToActive blocks if the SBN is doing checkpoint image transfer Key: HDFS-4816 URL: https://issues.apache.org/jira/browse/HDFS-4816 Project: Hadoop HDFS Issue Type: Bug Components: namenode Affects Versions: 2.0.4-alpha, 3.0.0 Reporter: Andrew Wang
The NN and SBN do this dance during checkpoint image transfer with nested HTTP GETs via {{HttpURLConnection}}. When an admin does a {{-transitionToActive}} during this transfer, part of that is interrupting an ongoing checkpoint so we can transition immediately. However, the {{thread.interrupt()}} in {{StandbyCheckpointer#stop}} gets swallowed by {{connection.getResponseCode()}} in {{TransferFsImage#doGetUrl}}. None of the methods in HttpURLConnection throw InterruptedException, so we need to do something else (perhaps HttpClient [1]): [1]: http://hc.apache.org/httpclient-3.x/ -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira