Wangda Tan created YARN-3410:

             Summary: YARN admin should be able to remove individual 
application record from RMStateStore
                 Key: YARN-3410
             Project: Hadoop YARN
          Issue Type: Bug
          Components: resourcemanager, yarn
            Reporter: Wangda Tan
            Priority: Critical

When RM state store entered an unexpected state, one example is YARN-2340, when 
an attempt is not in final state but app already completed, RM can never get up 
unless format RMStateStore.

I think we should support remove individual application records from 
RMStateStore to unblock RM admin make choice of either waiting for a fix or 
format state store.

In addition, RM should be able to report all fatal errors (which will shutdown 
RM) when doing app recovery, this can save admin some time to remove apps in 
bad state.

This message was sent by Atlassian JIRA

Reply via email to