[jira] [Resolved] (YUNIKORN-3003) Previously preempted allocations can be preempted again

Craig Condit (Jira) Thu, 19 Dec 2024 13:02:50 -0800


     [ 
https://issues.apache.org/jira/browse/YUNIKORN-3003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Craig Condit resolved YUNIKORN-3003.
------------------------------------
    Fix Version/s: 1.7.0
                   1.6.1
       Resolution: Fixed

Merged to master and cherry-picked to branch-1.6.

> Previously preempted allocations can be preempted again
> -------------------------------------------------------
>
>                 Key: YUNIKORN-3003
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-3003
>             Project: Apache YuniKorn
>          Issue Type: Bug
>          Components: core - scheduler
>            Reporter: Paul Santa Clara
>            Assignee: Paul Santa Clara
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.7.0, 1.6.1
>
>
> This is particularly apparent when preemption is used with longer values of 
> terminationGracePeriodSeconds.  A task can be selected for preemption and 
> during it's gracefulshutdown period, that same task can again be preempted.
> When this occurs, the preemptingResource for the impacted queue will again be 
> incremented for the SAME task preventing it from ever reaching zero again 
> even after all tasks have fully completed their termination and notified the 
> core scheduler.  After stopping all workloads, the preemptingResource will 
> remain positive unless the yunikorn scheduler pod is restarted
> ```
> {color:#1d1c1d} "preemptingResource": \{ "ephemeral-storage": 21474836480, 
> "memory": 4194304000, "pods": 1, "vcore": 1000 },{color}
> ``` 
> This preemptingResource leak, in turn, may convince the scheduler to avoid 
> future preemptions when it attempts to use it to compute the actual used 
> resources for a given queue: 
> [https://github.com/apache/yunikorn-core/blob/v1.6.0/pkg/scheduler/objects/preemption.go#L826]
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org

[jira] [Resolved] (YUNIKORN-3003) Previously preempted allocations can be preempted again

Reply via email to