Bibin A Chundatt created YARN-9640:
--------------------------------------

             Summary: Slow event processing could cause too many attempt 
unregister events
                 Key: YARN-9640
                 URL: https://issues.apache.org/jira/browse/YARN-9640
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Bibin A Chundatt


We found in one of our test cluster verification that the number attempt 
unregister events is about 300k+.
 # AM all containers completed.
 # AMRMClientImpl send finishApplcationMaster
 # AMRMClient check event 100ms the finish Status using finishApplicationMaster 
request.
 # AMRMClientImpl#unregisterApplicationMaster
{code:java}
      while (true) {
        FinishApplicationMasterResponse response =
            rmClient.finishApplicationMaster(request);
        if (response.getIsUnregistered()) {
          break;
        }
        LOG.info("Waiting for application to be successfully unregistered.");
        Thread.sleep(100);
      }
{code}

 # ApplicationMasterService finishApplicationMaster interface sends unregister 
events on every status update.

We should send unregister event only once and cache event send , ignore and 
send not unregistered response back to AM not overloading the event queue.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org

Reply via email to