[ 
https://issues.apache.org/jira/browse/YARN-6148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15865008#comment-15865008
 ] 

Rohith Sharma K S commented on YARN-6148:
-----------------------------------------

Sorry to pitch in late, went through the discussion above.  I think we should 
consider per partition node count strictly rather than assuming the application 
is running with one/two partition. 

In case of blacklisting scenario, application would hang if application is 
running with multiple partition. Lets say, application is running with <lableA, 
5> <labelB, 10><labelC, 10> If all nodes of labelB blacklisted by application, 
then purging will not happen at any cost. This leads to application hang 
forever.  YARN has to send per partition node count and also AM should handle 
per partition count blacklisting. Then the whole problem is solved.

> NM node count reported to AM in Allocate Response should consider requested 
> node label partitions.
> --------------------------------------------------------------------------------------------------
>
>                 Key: YARN-6148
>                 URL: https://issues.apache.org/jira/browse/YARN-6148
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Varun Saxena
>            Assignee: Varun Saxena
>         Attachments: YARN-6148.01.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to