Dmitry created YUNIKORN-2784:
--------------------------------

             Summary: Scheduler stuck
                 Key: YUNIKORN-2784
                 URL: https://issues.apache.org/jira/browse/YUNIKORN-2784
             Project: Apache YuniKorn
          Issue Type: Bug
            Reporter: Dmitry
         Attachments: Screenshot 2024-08-02 at 1.16.30 PM.png, Screenshot 
2024-08-02 at 1.20.23 PM.png, dumps.tgz, logs

Shortly after switching to yunikorn, a bunch of very small tiny pods get stuck 
pending (screenshot 1). Also all other ones, but these are the most visible and 
should be running 100%.

After restarting the scheduler, all get scheduled immediately (screenshot 2).

Attaching the output of `/ws/v1/stack`, `/ws/v1/fullstatedump` and 
`/debug/pprof/goroutine?debug=2`

Also logs from the scheduler.




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to