[ 
https://issues.apache.org/jira/browse/YUNIKORN-829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chenya Zhang closed YUNIKORN-829.
---------------------------------
    Resolution: Won't Do

> Produce metrics on queue-level resource utilization
> ---------------------------------------------------
>
>                 Key: YUNIKORN-829
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-829
>             Project: Apache YuniKorn
>          Issue Type: Sub-task
>          Components: core - scheduler, shim - kubernetes
>            Reporter: Chaoran Yu
>            Priority: Major
>
> YuniKorn already has metrics on the resources requested/allocated for each 
> queue. But we have no visibility into how much of the allocated resources are 
> actually being used. Take Spark as an example, an under-optimized job may 
> request 1 TB of total executor memory but the actual processing logic only 
> uses 100 GB. This has the consequence that other jobs might not be able to 
> fit in the queue. Having a metric that shows the real utilization will help 
> members of a queue better understand their job characteristics and optimize 
> the jobs.
> K8s metrics server has metrics on real utilization. YK may be able to perform 
> some aggregations to arrive at the stats at the queue level. This is a 
> k8s-specific solution though.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to