Craig Condit created YUNIKORN-2068:
--------------------------------------
Summary: Deadlock when evaluating nodes for preemption
Key: YUNIKORN-2068
URL: https://issues.apache.org/jira/browse/YUNIKORN-2068
Project: Apache YuniKorn
Issue Type: Bug
Components: shim - kubernetes
Reporter: Craig Condit
Assignee: Craig Condit
During preemption predicate checks, we have a read lock that can be acquired
multiple times without releasing, leading to deadlocks. Problematic ocde is in
context.IsPodFitNodeViaPreemption(), where ctx.schedulerCache.GetPod() is
called after ctx.schedulerCache.LockForReads(). Both functions acquire the
cache rlock.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]