Botong Huang created YARN-6955:
----------------------------------
Summary: Concurrent registerAM thread in Federation Interceptor
Key: YARN-6955
URL: https://issues.apache.org/jira/browse/YARN-6955
Project: Hadoop YARN
Issue Type: Bug
Reporter: Botong Huang
Assignee: Botong Huang
Priority: Minor
The timeout between AM and AMRMProxy is shorter than the timeout + failOver
between FederationInterceptor (AMRMProxy) and RM. When the first register
thread in FI is blocked because of an RM failover, AM can timeout and resend
register call, leading to two outstanding register call inside FI.
Eventually when RM comes back up, one thread succeeds register and the other
thread got an application already registered exception. FI should swallow the
exception and return success back to AM in both threads.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]