[ 
https://issues.apache.org/jira/browse/YARN-9948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16970038#comment-16970038
 ] 

Hu Ziqian commented on YARN-9948:
---------------------------------

Hi [~hex108], in our cluster, one attempt will use more than 1k memory in 
RMAppImpl. And it will also increase the memory size and response size when 
user call the restful api. In our cluster, most of the apps are streaming apps 
and some of them may try more than 10 thousand attempts. It's really a big 
memory cost.

 

in comment 2, it says that

_app will always retry if there are some attempts that does not count towards 
max attempt retry in the attempts we kept._

I think it won't happen because we only delete attempt which finish time < 
(endTime  - attemptFailuresValidityInterval). Although we delete some old 
attempts, it will not change the result of  (numberOfFailure < 
app.maxAppAttempts) and won't change the result of app state. If app try more 
than app.maxAppAttempts during attemptFailuresValidityInterval, it will failed 
as expected.

 

I also add a config to keep them in memory by default. 

> Remove attempts that are beyond max-attempt limit from RMAppImpl
> ----------------------------------------------------------------
>
>                 Key: YARN-9948
>                 URL: https://issues.apache.org/jira/browse/YARN-9948
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: resourcemanager
>    Affects Versions: 3.1.3
>            Reporter: Hu Ziqian
>            Priority: Major
>         Attachments: YARN-9948.001.patch
>
>
> RM will store app attempt in both state store and RMAppImpl. YARN-3480 
> removes attempts that are beyond max-attempt limit from state store.  In this 
> issue we delete those attempts in RMAppImpl the reduce decrease memory usage 
> of RM.
> We introduce flag yarn.resourcemanager.am.delete-old-attempts.enabled to 
> enable this logic, default value is false.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to