[
https://issues.apache.org/jira/browse/YARN-10456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17211439#comment-17211439
]
Eric Payne commented on YARN-10456:
-----------------------------------
We use hadoop-metrics2.properties to set up Simon metrics aggregation.
The format of the output of the RM aggregation metrics begins with:
{noformat}
EPOCH CONTEXT.RECORDNAME ...
{noformat}
Here, {{CONTEXT=yarn}} and {{RECORDNAME=QueueMetrics}} for both
{{QueueMetrics}} and {{PartitionQueueMetrics}}. This is incorrect and is
confusing the Simon aggregator and causing the numbers for several metrics to
be incorrect.
The {{RECORDNAME}} is coming from the {{MetricsInfo}} object in the
{{MetricsRegistry}} in each {{*Metrics}} class. In this case,
{{PartitionQueueMetrics}} is a child of the {{QueueMetrics}} class, and when
{{PartitionQueueMetrics}} is constructed, the {{MetricsInfo}} name for
{{PartitionQueueMetrics}} is assigned "{{QueueMetrics}}" instead of
"{{PartitionQueueMetrics}}".
> RM PartitionQueueMetrics records are named QueueMetrics in Simon metrics
> registry
> ---------------------------------------------------------------------------------
>
> Key: YARN-10456
> URL: https://issues.apache.org/jira/browse/YARN-10456
> Project: Hadoop YARN
> Issue Type: Improvement
> Components: resourcemanager
> Affects Versions: 3.3.0, 3.2.1, 3.1.4, 2.10.1
> Reporter: Eric Payne
> Assignee: Eric Payne
> Priority: Major
>
> Several queue metrics (such as AppsRunning, PendingContainers, etc.) stopped
> working after we upgraded to 2.10.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]