Shravan Achar created YUNIKORN-2526:
---------------------------------------
Summary: Discrepancy between shim cache and core app/task list
after scheduler restart
Key: YUNIKORN-2526
URL: https://issues.apache.org/jira/browse/YUNIKORN-2526
Project: Apache YuniKorn
Issue Type: Bug
Components: shim - kubernetes
Reporter: Shravan Achar
Attachments: log-snippet.txt, state-dump-4-1-3.json
When scheduler restarts, occasionally it gets into a situation where the
application is still in Running state despite the application getting
terminated in the cluster. This is confirmed with the attached state dump.
The scheduler core logs indicate all nodes are being evaluated for non-existing
application (also attached). The CPU is being used up doing this unneeded
evaluation.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]