[ 
https://issues.apache.org/jira/browse/AMBARI-5607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13994004#comment-13994004
 ] 

Dmytro Sen commented on AMBARI-5607:
------------------------------------

The main cause of the bug is hided in the default rrdtool/rrdfetch 
configuration and Ganglia's gmetad.conf 

If we don't set "s" parameter to rrd.py (rrdtool fetch, in fact) is uses 
default option values "s=now-1day&e=now". For the specified period circular 
buffer #2 contains more complete and detailed data than buffer #1. Setting 
default value between 6 minutes and 1 hour.

The patch is attached to the Review Board

> Yarn Nodemanager Metrics only update every few minutes
> ------------------------------------------------------
>
>                 Key: AMBARI-5607
>                 URL: https://issues.apache.org/jira/browse/AMBARI-5607
>             Project: Ambari
>          Issue Type: Bug
>    Affects Versions: 1.5.1
>         Environment: sles11sp1
>            Reporter: Michael Harp
>            Assignee: Dmytro Sen
>
> Yarn Nodemanager Metrics take far too long between updates.
> To demonstrate:
> Run Terasort or anything that runs mapreduce:
> {code}
> hdfs dfs -mkdir -p benchmarks/terasort
> hadoop jar /usr/lib/hadoop-mapreduce/hadoop-mapreduce-examples.jar teragen 
> -Dmapred.map.tasks=72 -Dmapred.reduce.tasks=36  1000000 
> benchmarks/terasort/input
> hadoop jar /usr/lib/hadoop-mapreduce/hadoop-mapreduce-examples.jar terasort 
> -Dmapred.map.tasks=72 -Dmapred.reduce.tasks=36 benchmarks/terasort/input 
> benchmarks/terasort/output
> hdfs dfs -rm -R -skipTrash benchmarks/terasort
> {code}
> Then repeatedly probe the API at:
> {noformat}https://<server>:8081/api/v1/clusters/c1/services/YARN/components/NODEMANAGER?fields=host_components/metrics/yarn{noformat}
> It usually takes 2-3 minutes to see the metrics update, very repeatable.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to