Weiwei Yang created YUNIKORN-201:
------------------------------------

             Summary: Application tracking API and CRD
                 Key: YUNIKORN-201
                 URL: https://issues.apache.org/jira/browse/YUNIKORN-201
             Project: Apache YuniKorn
          Issue Type: New Feature
          Components: core - scheduler, scheduler-interface, shim - kubernetes
            Reporter: Weiwei Yang
            Assignee: Kinga Marton


Today, YK works behind the scene, and the workflow is like
 # app operator or job server launch a bunch of pods on K8s
 # YK gets notified and group pods to apps based on appID
 # YK schedules the pods with respect to the app info

This provides a simple model to integrate with existing K8s and to support 
workloads, but it has some user experience issues. Such as
 # YK can hardly manage the app lifecycle end to end. An outstanding issue is 
we do not know when an app is finished if we only look at the pod status. 
 # YK doesn't have ability to admit apps. We need the ability to admit app 
based on various conditions, e.g resource quota, cluster overhead, ACL, etc. 
 # Hard to track app status. Sometimes app might be pending in resource queues, 
but we do not have a good way to expose such status info.

To further improve the user experience, we need to introduce an application 
tracking API and K8s custom resource definition (CRD). The CRD will be used by 
app operator/job server to interact with YK, to get the lifecycle fully 
controlled.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to