[ 
https://issues.apache.org/jira/browse/YARN-4953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313775#comment-15313775
 ] 

Rohith Sharma K S commented on YARN-4953:
-----------------------------------------

bq. The main issue with aggregating as containers complete is the additional 
load on the namenode
Right, this is major issue in large cluster.

Since log rolling is supported I think it is worth to delete aggregated 
completed container log folders when log rolling is enabled.  Any potential 
issues with it. Thoughts?

> Delete completed container log folder when rolling log aggregation is enabled
> -----------------------------------------------------------------------------
>
>                 Key: YARN-4953
>                 URL: https://issues.apache.org/jira/browse/YARN-4953
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>            Reporter: Rohith Sharma K S
>            Assignee: Rohith Sharma K S
>
> There would be potential bottle neck when cluster is running with very large 
> number of containers on the same NodeManager for single application. The 
> linux limits the subfolders count to 32K. If number of containers is greater 
> than 32K for an application, there would be container launch failure. At this 
> point of time, there are no more containers can be launched in this node.
> Currently log folders are deleted after app is finished. Rolling log 
> aggregation aggregates logs to hdfs periodically. 
> I think if aggregation is completed for finished containers, then clean up 
> can be done i.e deleting log folder for finished containers. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to