Mayank Bansal created YARN-2459:
-----------------------------------

             Summary: RM crashes if App gets rejected for any reason and HA is 
enabled
                 Key: YARN-2459
                 URL: https://issues.apache.org/jira/browse/YARN-2459
             Project: Hadoop YARN
          Issue Type: Bug
          Components: resourcemanager
    Affects Versions: 2.4.1
            Reporter: Mayank Bansal
            Assignee: Mayank Bansal
             Fix For: 2.5.0


If RM HA is enabled and used Zookeeper store for RM State Store.
If for any reason Any app gets rejected and directly goes to NEW to FAILED
then final transition makes that to RMApps and Completed Apps memory structure 
but that doesn't make it to State store.
Now when RMApps default limit reaches it starts deleting apps from memory and 
store. In that case it try to delete this app from store and fails which causes 
RM to crash.

Thanks,
Mayank



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to