Ayub Pathan created YUNIKORN-520:
------------------------------------
Summary: Gang scheduling waits indefinitely for placeholder pod
allocation even where is no quota left in the queue
Key: YUNIKORN-520
URL: https://issues.apache.org/jira/browse/YUNIKORN-520
Project: Apache YuniKorn
Issue Type: Bug
Components: core - scheduler
Affects Versions: 0.10
Reporter: Ayub Pathan
Attachments: job.yaml, ns.yaml
* Create a namespace([^ns.yaml] with quota.
* Submit a job([^job.yaml] where the placeholder pods resource requests are
more than queue quota.
The YK waits indefinitely for the placeholder pods minMember limit to be
satisfied. Gang scheduling can action to avoid this indefinite lock situation,
incase of resource limits.
{noformat}
NAME↑ READY STATUS RS CPU MEM %CPU/R %MEM/R %CPU/L
%MEM/L IP NODE QOS
AGE │
│ batch-sleep-job-4-hzc49 0/0 Pending 0 n/a n/a n/a n/a n/a
n/a n/a n/a BU
4h28m │
│ batch-sleep-job-4-jmqbf 0/0 Pending 0 n/a n/a n/a n/a n/a
n/a n/a n/a BU
4h28m │
│ batch-sleep-job-4-kg7hl 0/0 Pending 0 n/a n/a n/a n/a n/a
n/a n/a n/a BU
4h28m │
│ tg-tg1-batch-sleep-job-4-0 1/1 Running 0 n/a n/a n/a n/a n/a
n/a 100.100.134.233 ip-10-192-136-201.ca-central-1.compute.internal BU
4h28m │
│ tg-tg1-batch-sleep-job-4-1 1/1 Running 0 n/a n/a n/a n/a n/a
n/a 100.100.87.75 ip-10-192-159-151.ca-central-1.compute.internal BU
4h28m │
│ tg-tg1-batch-sleep-job-4-2 0/0 Pending 0 n/a n/a n/a n/a n/a
n/a n/a n/a BU
4h28m │ {noformat}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]