[ 
https://issues.apache.org/jira/browse/YARN-11965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18088415#comment-18088415
 ] 

ASF GitHub Bot commented on YARN-11965:
---------------------------------------

zhengchenyu opened a new pull request, #8547:
URL: https://github.com/apache/hadoop/pull/8547

   ### Description of PR
   
   When node labels are enabled, different labels represent separate resource 
pools. However, the RM REST API `/ws/v1/cluster/metrics` currently exposes 
fields such as totalMB and totalVirtualCores based on the default partition 
only. As a result, resources from non-default partitions are not visible to 
external resource management systems, which may incorrectly determine that the 
cluster has no available capacity after new labels are added.
   
   The API should expose cluster resource metrics across all partitions and 
provide partition-level metrics so clients can distinguish capacity and usage 
by node label.
   
   ### How was this patch tested?
   
   unit test and test in cluster based on internal hadoop version.
   
   ### For code changes:
   
   - [ ] Does the title or this PR starts with the corresponding JIRA issue id 
(e.g. 'HADOOP-17799. Your PR title ...')?
   - [ ] Object storage: have the integration tests been executed and the 
endpoint declared according to the connector-specific documentation?
   - [ ] If adding new dependencies to the code, are these dependencies 
licensed in a way that is compatible for inclusion under [ASF 
2.0](http://www.apache.org/legal/resolved.html#category-a)?
   - [ ] If applicable, have you updated the `LICENSE`, `LICENSE-binary`, 
`NOTICE-binary` files?
   
   ### AI Tooling
   
   If an AI tool was used:
   
   - [ ] The PR includes the phrase "Contains content generated by <tool>"
         where <tool> is the name of the AI tool used.
   - [ ] My use of AI contributions follows the ASF legal policy
         https://www.apache.org/legal/generative-tooling.html




> Support partition-aware resource metrics in RM cluster metrics REST API
> -----------------------------------------------------------------------
>
>                 Key: YARN-11965
>                 URL: https://issues.apache.org/jira/browse/YARN-11965
>             Project: Hadoop YARN
>          Issue Type: Improvement
>            Reporter: Chenyu Zheng
>            Assignee: Chenyu Zheng
>            Priority: Major
>
> When node labels are enabled, different labels represent separate resource 
> pools. However, the RM REST API `/ws/v1/cluster/metrics` currently exposes 
> fields such as totalMB and totalVirtualCores based on the default partition 
> only. As a result, resources from non-default partitions are not visible to 
> external resource management systems, which may incorrectly determine that 
> the cluster has no available capacity after new labels are added.
> The API should expose cluster resource metrics across all partitions and 
> provide partition-level metrics so clients can distinguish capacity and usage 
> by node label.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to