Peter Bacsko created YUNIKORN-1187:
--------------------------------------
Summary: [Umbrella] Recovery stabilization
Key: YUNIKORN-1187
URL: https://issues.apache.org/jira/browse/YUNIKORN-1187
Project: Apache YuniKorn
Issue Type: Improvement
Components: shim - kubernetes
Reporter: Peter Bacsko
In the past weeks, we discovered numerous problem that can occur during the
recovery phase. We need to make that part more reliable, because jobs can get
stuck, internal states are not restored properly, etc.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]