[ 
https://issues.apache.org/jira/browse/YARN-9830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16928305#comment-16928305
 ] 

Bibin A Chundatt edited comment on YARN-9830 at 9/12/19 7:22 AM:
-----------------------------------------------------------------

AbstractLivelinessMonitor methods are synchronized which blocks concurrent 
access based on multiple containerIds

PingThread actually monitors the *running* containers items .

In AbstractLivelinessMonitor#running could be changed to concurrentHashMap and 
remove the synchronization at the class level.??

[~rohithsharma]/[~sunil.gov...@gmail.com]



was (Author: bibinchundatt):
AbstractLivelinessMonitor and methods are synchronized which block concurrent 
access based on multiple containerIds

PingThread actually monitors the *running* containers items .

In AbstractLivelinessMonitor#running could be changed to concurrentHashMap and 
remove the synchronization at the class level.??


> Improve ContainerAllocationExpirer it blocks scheduling
> -------------------------------------------------------
>
>                 Key: YARN-9830
>                 URL: https://issues.apache.org/jira/browse/YARN-9830
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Bibin A Chundatt
>            Priority: Critical
>              Labels: perfomance
>
> {quote}
>    java.lang.Thread.State: BLOCKED (on object monitor)
>         at 
> org.apache.hadoop.yarn.util.AbstractLivelinessMonitor.register(AbstractLivelinessMonitor.java:106)
>         - waiting to lock <0x00007fa348749550> (a 
> org.apache.hadoop.yarn.server.resourcemanager.rmcontainer.ContainerAllocationExpirer)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.rmcontainer.RMContainerImpl$AcquiredTransition.transition(RMContainerImpl.java:601)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.rmcontainer.RMContainerImpl$AcquiredTransition.transition(RMContainerImpl.java:592)
>         at 
> org.apache.hadoop.yarn.state.StateMachineFactory$SingleInternalArc.doTransition(StateMachineFactory.java:362)
>         at 
> org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:302)
>         at 
> org.apache.hadoop.yarn.state.StateMachineFactory.access$500(StateMachineFactory.java:46)
>         at 
> org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:487)
>         - locked <0x00007fc8852f8200> (a 
> org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.rmcontainer.RMContainerImpl.handle(RMContainerImpl.java:474)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.rmcontainer.RMContainerImpl.handle(RMContainerImpl.java:65)
> {quote}



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to