Jian He created YARN-2721: ----------------------------- Summary: Race condition: ZKRMStateStore retry logic may throw NodeExist exception Key: YARN-2721 URL: https://issues.apache.org/jira/browse/YARN-2721 Project: Hadoop YARN Issue Type: Bug Reporter: Jian He Assignee: Jian He Fix For: 2.6.0
Blindly retrying operations in zookeeper will not work for non-idempotent operations (like create znode). The reason is that the client can do a create znode, but the response may not be returned because the server can die or timeout. In case of retrying the create znode, it will throw a NODE_EXISTS exception from the earlier create from the same session. -- This message was sent by Atlassian JIRA (v6.3.4#6332)