[ https://issues.apache.org/jira/browse/YARN-8291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Prabhu Joseph updated YARN-8291: -------------------------------- Affects Version/s: (was: 2.7.3) 3.0.0 > RMRegistryOperationService don't have limit on AsyncPurge threads > ----------------------------------------------------------------- > > Key: YARN-8291 > URL: https://issues.apache.org/jira/browse/YARN-8291 > Project: Hadoop YARN > Issue Type: Bug > Components: yarn > Affects Versions: 3.0.0 > Reporter: Prabhu Joseph > Priority: Major > > When there are more than 10000+ containers finished - > RMRegistryOperationService will create 10000+ threads for performing > AsyncPurge which can slowdown the ResourceManager process. There should be a > limit on the number of threads. > {code} > "RegistryAdminService 554485" #824351 prio=5 os_prio=0 tid=0x00007fe4b2bc9800 > nid=0xf8ed in Object.wait() [0x00007fe31a5e4000] > java.lang.Thread.State: WAITING (on object monitor) > at java.lang.Object.wait(Native Method) > at java.lang.Object.wait(Object.java:502) > at org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1386) > - locked <0x00000007902ec7d8> (a > org.apache.zookeeper.ClientCnxn$Packet) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1040) > at > org.apache.curator.framework.imps.ExistsBuilderImpl$2.call(ExistsBuilderImpl.java:172) > at > org.apache.curator.framework.imps.ExistsBuilderImpl$2.call(ExistsBuilderImpl.java:161) > at org.apache.curator.RetryLoop.callWithRetry(RetryLoop.java:107) > at > org.apache.curator.framework.imps.ExistsBuilderImpl.pathInForeground(ExistsBuilderImpl.java:158) > at > org.apache.curator.framework.imps.ExistsBuilderImpl.forPath(ExistsBuilderImpl.java:148) > at > org.apache.curator.framework.imps.ExistsBuilderImpl.forPath(ExistsBuilderImpl.java:36) > at > org.apache.hadoop.registry.client.impl.zk.CuratorService.zkStat(CuratorService.java:455) > at > org.apache.hadoop.registry.client.impl.zk.RegistryOperationsService.stat(RegistryOperationsService.java:137) > at > org.apache.hadoop.registry.client.binding.RegistryUtils.statChildren(RegistryUtils.java:210) > at > org.apache.hadoop.registry.server.services.RegistryAdminService.purge(RegistryAdminService.java:450) > at > org.apache.hadoop.registry.server.services.RegistryAdminService.purge(RegistryAdminService.java:520) > at > org.apache.hadoop.registry.server.services.RegistryAdminService$AsyncPurge.call(RegistryAdminService.java:570) > at > org.apache.hadoop.registry.server.services.RegistryAdminService$AsyncPurge.call(RegistryAdminService.java:543) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org