[jira] [Commented] (YUNIKORN-2895) Don't add duplicated allocation to node when the allocation ask fails

Damon Cortesi (Jira) Tue, 08 Oct 2024 21:54:04 -0700


    [ 
https://issues.apache.org/jira/browse/YUNIKORN-2895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17887765#comment-17887765
 ]


Damon Cortesi commented on YUNIKORN-2895:
-----------------------------------------

Also experiencing this after upgrading 1.5.2 to 1.6.0. 

The first time it happened, the scheduler had error logs about orphan 
applications. The second time, the scheduler still reports as healthy, but 
applications don't get scheduled. 

{{Request '016df77b-121e-4e22-ab9b-7ae64485e2d2' does not fit in queue 
'root.<redacted>' (requested map[memory:2550136832 pods:1 vcore:1000], 
available map[ephemeral-storage:891547735818 memory:133673287680 pods:1614 
vcore:-137500[])}}

This is on a very low volume test cluster, so let me know if I can help provide 
any info.

> Don't add duplicated allocation to node when the allocation ask fails
> ---------------------------------------------------------------------
>
>                 Key: YUNIKORN-2895
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-2895
>             Project: Apache YuniKorn
>          Issue Type: Bug
>          Components: core - scheduler
>            Reporter: Qi Zhu
>            Assignee: Qi Zhu
>            Priority: Critical
>
> When i try to revisit the new update allocation logic, the potential 
> duplicated allocation to node will happen if the allocation already 
> allocated.  And we try to add the allocation to the node again and don't 
> revert it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (YUNIKORN-2895) Don't add duplicated allocation to node when the allocation ask fails

Reply via email to