[ 
https://issues.apache.org/jira/browse/YUNIKORN-2535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rainie Li updated YUNIKORN-2535:
--------------------------------
    Description: 
We want to enhance YuniKorn's capability to guarantee Service Level Objectives 
(SLOs) for critical applications, particularly in scenarios where the cluster 
has limited resources. 
YuniKorn will be responsible for monitoring the real-time status of 
applications and dynamically adjust the queues (including resources, priority, 
etc.) aligned with the applications' SLOs.

Here is initial proposal 
[https://docs.google.com/document/d/1c8tzmEgl32o6_0eDxQ1ZiMRcD-1QRvdu-3PABNZQX30/edit#heading=h.nkwxxppm7zbd]
 feature 2.

We had some discussions with the community during meetup on 04/03/24 and 
received valuable insights and suggestions.

Moving forward, we will create a more detailed design doc. Once ready, it will 
be open for review and further discussions with the community.

cc [~wilfreds] [~ccondit]

  was:
We want to guarantee SLO for critical apps when a cluster has limited resources.
YuniKorn can track applications' actual status and dynamically adjusting 
queues(resource, priority, etc) based on the SLO of applications.

Here is initial proposal Feature 2 
[https://docs.google.com/document/d/1c8tzmEgl32o6_0eDxQ1ZiMRcD-1QRvdu-3PABNZQX30/edit#heading=h.nkwxxppm7zbd]
 

We had some discussions with the community during meetup on 04/03/24.

Next step: we will create a more detailed design doc and review with the 
community. 

cc [~wilfreds] [~ccondit]


> [Umbrella] YuniKorn: Dynamically Adjust Queue to Ensure App Service Level 
> Objectives (SLO)
> ------------------------------------------------------------------------------------------
>
>                 Key: YUNIKORN-2535
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-2535
>             Project: Apache YuniKorn
>          Issue Type: New Feature
>          Components: core - common, core - scheduler, shim - kubernetes
>            Reporter: Rainie Li
>            Assignee: Rainie Li
>            Priority: Major
>
> We want to enhance YuniKorn's capability to guarantee Service Level 
> Objectives (SLOs) for critical applications, particularly in scenarios where 
> the cluster has limited resources. 
> YuniKorn will be responsible for monitoring the real-time status of 
> applications and dynamically adjust the queues (including resources, 
> priority, etc.) aligned with the applications' SLOs.
> Here is initial proposal 
> [https://docs.google.com/document/d/1c8tzmEgl32o6_0eDxQ1ZiMRcD-1QRvdu-3PABNZQX30/edit#heading=h.nkwxxppm7zbd]
>  feature 2.
> We had some discussions with the community during meetup on 04/03/24 and 
> received valuable insights and suggestions.
> Moving forward, we will create a more detailed design doc. Once ready, it 
> will be open for review and further discussions with the community.
> cc [~wilfreds] [~ccondit]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to