[ 
https://issues.apache.org/jira/browse/YARN-4165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14904028#comment-14904028
 ] 

Weiwei Yang commented on YARN-4165:
-----------------------------------

Hello Jason 

Thanks for looking this. I checked YARN 957 but I think this is a different 
problem. 

I have 3 nodes

NM1 8G
NM2 8G
NM3 8G

I submitted an application, requires 4 containers and each of them relative big 
memory like 5G, its app master requires 1G, so RM fills 3 containers and 1 app 
master but leaving 1 outstanding request, *unexpectedly* RM reserved 1 
container on all 3 nodes like

NM1 - 1 container, 1 app master - 6G used - 2G left - 5G reserved 
NM2 - 1 container - 5G used - 3G left - 5G reserved
NM3 - 1 container - 5G used - 3G left - 5G reserved

I am not sure yet why we run into such situation, but it might be related to 
YARN-1769, I am still investigating, if you have any pointers or comments, 
please let me know. Thanks.


> An outstanding container request makes all nodes to be reserved causing all 
> jobs pending
> ----------------------------------------------------------------------------------------
>
>                 Key: YARN-4165
>                 URL: https://issues.apache.org/jira/browse/YARN-4165
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: capacity scheduler, resourcemanager, scheduler
>    Affects Versions: 2.7.1
>            Reporter: Weiwei Yang
>            Assignee: Weiwei Yang
>
> We have a long running service in YARN, it has a outstanding container 
> request that YARN cannot satisfy (require more memory that nodemanager can 
> supply). Then YARN reserves all nodes for this application, when I submit 
> other jobs (require relative small memory that nodemanager can supply), all 
> jobs are pending because YARN skips scheduling containers on the nodes that 
> have been reserved.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to