Shravan Achar created YUNIKORN-2526:
---------------------------------------

             Summary: Discrepancy between shim cache and core app/task list 
after scheduler restart
                 Key: YUNIKORN-2526
                 URL: https://issues.apache.org/jira/browse/YUNIKORN-2526
             Project: Apache YuniKorn
          Issue Type: Bug
          Components: shim - kubernetes
            Reporter: Shravan Achar
         Attachments: log-snippet.txt, state-dump-4-1-3.json

When scheduler restarts, occasionally it gets into a situation where the 
application is still in Running state despite the application getting 
terminated in the cluster. This is confirmed with the attached state dump.

 

The scheduler core logs indicate all nodes are being evaluated for non-existing 
application (also attached). The CPU is being used up doing this unneeded 
evaluation.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to