[
https://issues.apache.org/jira/browse/YARN-1166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zhijie Shen updated YARN-1166:
------------------------------
Attachment: YARN-1166.6.patch
Given YARN-1493 separates app and app-attempt events, I created a new patch
which not only change 'appsFailed' from MutableGaugeInt to MutableCounterInt,
but also bind the change of the counter with the correct event:
1. SubmitApp: increment appsSubmitted
2. SubmitAppAttempt: increment appsPending
3. RunAppAttempt: decrement appsPending, increment appsRunning
4. FinishAppAttempt: decrement appsRunning
5. FinishApp: increment appsCompleted/appsKilled/appsFailed
1,2 and 5 are binded to the app related events, and are always increasing,
while 3 and 4 are binded to the app-attempt related events, and can increase
and decrease.
> YARN 'appsFailed' metric should be of type 'counter'
> ----------------------------------------------------
>
> Key: YARN-1166
> URL: https://issues.apache.org/jira/browse/YARN-1166
> Project: Hadoop YARN
> Issue Type: Bug
> Components: resourcemanager
> Affects Versions: 2.1.0-beta
> Reporter: Srimanth Gunturi
> Assignee: Zhijie Shen
> Priority: Blocker
> Attachments: YARN-1166.2.patch, YARN-1166.3.patch, YARN-1166.4.patch,
> YARN-1166.5.patch, YARN-1166.6.patch, YARN-1166.patch
>
>
> Currently in YARN's queue metrics, the cumulative metric 'appsFailed' is of
> type 'guage' - which means the exact value will be reported.
> All other cumulative queue metrics (AppsSubmitted, AppsCompleted, AppsKilled)
> are all of type 'counter' - meaning Ganglia will use slope to provide deltas
> between time-points.
> To be consistent, AppsFailed metric should also be of type 'counter'.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)