[ 
https://issues.apache.org/jira/browse/YARN-8138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zian Chen updated YARN-8138:
----------------------------
    Description: 
Add unit test to validate queue priority preemption works under node partition.

Test configuration:
queue A (capacity=50, priority=1)
queue B (capacity=50, priority=2)
both have accessible-node-labels set to x
A.accessible-node-labels.x.capacity = 50
B.accessible-node-labels.x.capacity = 50
Along with this pre-emption related properties have been set.

Test steps:
 - Submit an application A1 to B, with am-container = container = 4096, no. of 
containers = 4
 - Submit an application A2 to A, with am-container = 1024, container = 2048, 
no of containers = (NUM_NM-1)
 - Kill application A1
 - Submit an application A3 to B with am-container=container=5210, no. of 
containers=NUM_NM
 - Expectation is that containers are pre-empted from application A2 to A3 but 
there is no container pre-emption happening

  was:
There seems to be an issue with pre-emption when using node labels with queue 
priority.

Test configuration:
queue A (capacity=50, priority=1)
queue B (capacity=50, priority=2)
both have accessible-node-labels set to x
A.accessible-node-labels.x.capacity = 50
B.accessible-node-labels.x.capacity = 50
Along with this pre-emption related properties have been set.

Test steps:
 - Set NM memory = 6000MB and containerMemory = 750MB
 - Submit an application A1 to B, with am-container = container = 
(6000-750-1500), no. of containers = 2
 - Submit an application A2 to A, with am-container = 750, container = 1500, no 
of containers = (NUM_NM-1)
 - Kill application A1
 - Submit an application A3 to B with am-container=container=5000, no. of 
containers=3
 - Expectation is that containers are pre-empted from application A2 to A3 but 
there is no container pre-emption happening
Container pre-emption is stuck with the message in the RM log,
{noformat}
2018-02-02 11:41:36,974 INFO capacity.CapacityScheduler 
(CapacityScheduler.java:tryCommit(2673)) - Allocation proposal accepted
2018-02-02 11:41:36,984 INFO capacity.CapacityScheduler 
(CapacityScheduler.java:allocateContainerOnSingleNode(1391)) - Trying to 
fulfill reservation for application application_1517571510094_0003 on node: 
XXXXXXXXXX:25454
2018-02-02 11:41:36,984 INFO allocator.AbstractContainerAllocator 
(AbstractContainerAllocator.java:getCSAssignmentFromAllocateResult(97)) - 
Reserved container application=application_1517571510094_0003 
resource=<memory:3072, vCores:1> 
queue=org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.allocator.RegularContainerAllocator@3f04848e
 cluster=<memory:18000, vCores:3>
2018-02-02 11:41:36,984 INFO capacity.CapacityScheduler 
(CapacityScheduler.java:tryCommit(2673)) - Allocation proposal accepted
2018-02-02 11:41:36,984 INFO capacity.CapacityScheduler 
(CapacityScheduler.java:allocateContainerOnSingleNode(1391)) - Trying to 
fulfill reservation for application application_1517571510094_0003 on node: 
XXXXXXXXXX:25454
2018-02-02 11:41:36,984 INFO allocator.AbstractContainerAllocator 
(AbstractContainerAllocator.java:getCSAssignmentFromAllocateResult(97)) - 
Reserved container application=application_1517571510094_0003 
resource=<memory:3072, vCores:1> 
queue=org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.allocator.RegularContainerAllocator@3f04848e
 cluster=<memory:18000, vCores:3>
2018-02-02 11:41:36,984 INFO capacity.CapacityScheduler 
(CapacityScheduler.java:tryCommit(2673)) - Allocation proposal accepted
2018-02-02 11:41:36,994 INFO capacity.CapacityScheduler 
(CapacityScheduler.java:allocateContainerOnSingleNode(1391)) - Trying to 
fulfill reservation for application application_1517571510094_0003 on node: 
XXXXXXXXXX:25454
2018-02-02 11:41:36,995 INFO allocator.AbstractContainerAllocator 
(AbstractContainerAllocator.java:getCSAssignmentFromAllocateResult(97)) - 
Reserved container application=application_1517571510094_0003 
resource=<memory:3072, vCores:1> 
queue=org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.allocator.RegularContainerAllocator@3f04848e
 cluster=<memory:18000, vCores:3>{noformat}


> Add unit test to validate queue priority preemption works under node 
> partition.
> -------------------------------------------------------------------------------
>
>                 Key: YARN-8138
>                 URL: https://issues.apache.org/jira/browse/YARN-8138
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Charan Hebri
>            Assignee: Zian Chen
>            Priority: Minor
>         Attachments: YARN-8138.001.patch, YARN-8138.002.patch
>
>
> Add unit test to validate queue priority preemption works under node 
> partition.
> Test configuration:
> queue A (capacity=50, priority=1)
> queue B (capacity=50, priority=2)
> both have accessible-node-labels set to x
> A.accessible-node-labels.x.capacity = 50
> B.accessible-node-labels.x.capacity = 50
> Along with this pre-emption related properties have been set.
> Test steps:
>  - Submit an application A1 to B, with am-container = container = 4096, no. 
> of containers = 4
>  - Submit an application A2 to A, with am-container = 1024, container = 2048, 
> no of containers = (NUM_NM-1)
>  - Kill application A1
>  - Submit an application A3 to B with am-container=container=5210, no. of 
> containers=NUM_NM
>  - Expectation is that containers are pre-empted from application A2 to A3 
> but there is no container pre-emption happening



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to