[ 
https://issues.apache.org/jira/browse/YARN-10456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17211439#comment-17211439
 ] 

Eric Payne commented on YARN-10456:
-----------------------------------

We use hadoop-metrics2.properties to set up Simon metrics aggregation.

The format of the output of the RM aggregation metrics begins with:
{noformat}
EPOCH CONTEXT.RECORDNAME ...
{noformat}
Here, {{CONTEXT=yarn}} and {{RECORDNAME=QueueMetrics}} for both 
{{QueueMetrics}} and {{PartitionQueueMetrics}}. This is incorrect and is 
confusing the Simon aggregator and causing the numbers for several metrics to 
be incorrect.

The {{RECORDNAME}} is coming from the {{MetricsInfo}} object in the 
{{MetricsRegistry}} in each {{*Metrics}} class. In this case, 
{{PartitionQueueMetrics}} is a child of the {{QueueMetrics}} class, and when 
{{PartitionQueueMetrics}} is constructed, the {{MetricsInfo}} name for 
{{PartitionQueueMetrics}} is assigned "{{QueueMetrics}}" instead of 
"{{PartitionQueueMetrics}}".

> RM PartitionQueueMetrics records are named QueueMetrics in Simon metrics 
> registry
> ---------------------------------------------------------------------------------
>
>                 Key: YARN-10456
>                 URL: https://issues.apache.org/jira/browse/YARN-10456
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: resourcemanager
>    Affects Versions: 3.3.0, 3.2.1, 3.1.4, 2.10.1
>            Reporter: Eric Payne
>            Assignee: Eric Payne
>            Priority: Major
>
> Several queue metrics (such as AppsRunning, PendingContainers, etc.) stopped 
> working after we upgraded to 2.10.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to