Jing Zhao created HDFS-6229:
-------------------------------
Summary: Race condition in failover can cause RetryCache fail to
work
Key: HDFS-6229
URL: https://issues.apache.org/jira/browse/HDFS-6229
Project: Hadoop HDFS
Issue Type: Bug
Components: ha
Affects Versions: 2.1.0-beta
Reporter: Jing Zhao
Assignee: Jing Zhao
Currently when NN failover happens, the old SBN first sets its state to active,
then starts the active services (including tailing all the remaining editlog
and building a complete retry cache based on the editlog). If a retry request,
which has already succeeded in the old ANN (but the client fails to receive the
response), comes in between, this retry may still get served by the new ANN but
miss the retry cache.
--
This message was sent by Atlassian JIRA
(v6.2#6252)