[
https://issues.apache.org/jira/browse/YARN-9437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
qiuliang updated YARN-9437:
---------------------------
Description: We use hadoop-2.9.1 in our production environment with 1600+
nodes. 95.63% of RM memory is occupied by RMNodeImpl. Analysis of RM memory
found that each RMNodeImpl has approximately 14M. The reason is that there is a
130,000+ completedcontainers in each RMNodeImpl that has not been released.
(was: We use hadoop-2.9.1 in our production environment with 1600+ nodes.
95.63% of RM memory is occupied by RMNodeImpl. Analysis of RM memory found that
each RMNodeImpl has approximately 14M. The reason is that there is a 13W+
completedcontainers in each RMNodeImpl that has not been released.)
> RMNodeImpls occupy too much memory and causes RM GC to take a long time
> -----------------------------------------------------------------------
>
> Key: YARN-9437
> URL: https://issues.apache.org/jira/browse/YARN-9437
> Project: Hadoop YARN
> Issue Type: Bug
> Components: resourcemanager
> Affects Versions: 2.9.1
> Reporter: qiuliang
> Priority: Blocker
> Attachments: 1.png, 2.png, 3.png
>
>
> We use hadoop-2.9.1 in our production environment with 1600+ nodes. 95.63% of
> RM memory is occupied by RMNodeImpl. Analysis of RM memory found that each
> RMNodeImpl has approximately 14M. The reason is that there is a 130,000+
> completedcontainers in each RMNodeImpl that has not been released.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]