[ https://issues.apache.org/jira/browse/YARN-11277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17729146#comment-17729146 ]
ASF GitHub Bot commented on YARN-11277: --------------------------------------- slfan1989 commented on PR #4797: URL: https://github.com/apache/hadoop/pull/4797#issuecomment-1575969505 @leixm Thank you for your contribution! I will merge this PR into the trunk branch. @ashutoshcipher @aajisaka If you have any new suggestions, feel free to bring them up anytime. Thanks again for helping to review the code! > trigger deletion of log-dir by size for NonAggregatingLogHandler > ---------------------------------------------------------------- > > Key: YARN-11277 > URL: https://issues.apache.org/jira/browse/YARN-11277 > Project: Hadoop YARN > Issue Type: Improvement > Components: nodemanager > Affects Versions: 3.4.0 > Reporter: Xianming Lei > Priority: Minor > Labels: pull-request-available > > In our yarn cluster, the log files of some containers are too large, which > causes the NodeManager to frequently switch to the unhealthy state. For logs > that are too large, we can consider deleting them directly without delaying > yarn.nodemanager.log.retain-seconds. > Cluster environment: > # 8k nodes+ > # 50w+ apps / day > Configuration: > # yarn.nodemanager.log.retain-seconds=3days > # yarn.log-aggregation-enable=false > > -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org