[
https://issues.apache.org/jira/browse/AMBARI-20071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15876500#comment-15876500
]
Hudson commented on AMBARI-20071:
---------------------------------
SUCCESS: Integrated in Jenkins build Ambari-branch-2.5 #1046 (See
[https://builds.apache.org/job/Ambari-branch-2.5/1046/])
AMBARI-20071 : Hadoop metrics sink prints lots of logs if collector is
(avijayan:
[http://git-wip-us.apache.org/repos/asf?p=ambari.git&a=commit&h=4279738af80a4b448ba24906e2144b5cd0806b63])
* (edit)
ambari-metrics/ambari-metrics-common/src/main/java/org/apache/hadoop/metrics2/sink/timeline/AbstractTimelineMetricsSink.java
* (edit)
ambari-metrics/ambari-metrics-hadoop-sink/src/main/java/org/apache/hadoop/metrics2/sink/timeline/HadoopTimelineMetricsSink.java
> Hadoop metrics sink prints lots of logs if collector is unavailable
> -------------------------------------------------------------------
>
> Key: AMBARI-20071
> URL: https://issues.apache.org/jira/browse/AMBARI-20071
> Project: Ambari
> Issue Type: Bug
> Components: ambari-metrics
> Affects Versions: 2.5.0
> Reporter: Aravindan Vijayan
> Assignee: Aravindan Vijayan
> Priority: Critical
> Fix For: 2.5.0
>
> Attachments: AMBARI-20071.patch
>
>
> The metrics sink prints lots of such messages in Hadoop daemons log every
> second, which makes logs rotates and purge fast.
> {code}
> 2017-02-16 19:05:48,896 INFO timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) -
> Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live.
> Removing it from list of know live collector hosts : []
> 2017-02-16 19:05:48,896 WARN timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399)) - Couldn't
> find any live collectors. Returning null
> 2017-02-16 19:05:48,896 WARN timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:emitMetrics(227)) - No live collector to
> send metrics to. Metrics to be sent will be discarded.
> 2017-02-16 19:05:50,901 WARN timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476))
> - Unable to connect to collector to find live nodes.
> 2017-02-16 19:05:50,901 INFO timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) -
> Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live.
> Removing it from list of know live collector hosts : []
> 2017-02-16 19:05:50,901 WARN timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399)) - Couldn't
> find any live collectors. Returning null
> 2017-02-16 19:05:50,902 WARN timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:emitMetrics(227)) - No live collector to
> send metrics to. Metrics to be sent will be discarded.
> 2017-02-16 19:06:48,896 WARN timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476))
> - Unable to connect to collector to find live nodes.
> 2017-02-16 19:06:48,897 INFO timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) -
> Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live.
> Removing it from list of know live collector hosts : []
> 2017-02-16 19:06:48,897 INFO timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(359)) - No live
> collectors from configuration. Requesting zookeeper...
> 2017-02-16 19:06:48,988 INFO timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(369)) - No new
> collector was found from Zookeeper. Will not request zookeeper for 120000
> millis
> 2017-02-16 19:06:48,989 INFO
> availability.MetricSinkWriteShardHostnameHashingStrategy
> (MetricSinkWriteShardHostnameHashingStrategy.java:findCollectorShard(42)) -
> Calculated collector shard ctr-e129-1487033772569-2546-01-000004.hwx.site
> based on hostname: ctr-e129-1487033772569-2546-01-000003.hwx.site
> 2017-02-16 19:06:59,004 INFO timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:emitMetrics(217)) - Removing collector
> ctr-e129-1487033772569-2546-01-000004.hwx.site from allKnownLiveCollectors.
> 2017-02-16 19:07:01,009 WARN timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476))
> - Unable to connect to collector to find live nodes.
> 2017-02-16 19:07:01,010 INFO timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) -
> Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live.
> Removing it from list of know live collector hosts : []
> 2017-02-16 19:07:01,010 WARN timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399)) - Couldn't
> find any live collectors. Returning null
> 2017-02-16 19:07:01,010 WARN timeline.HadoopTimelineMetricsSink
> (AbstractTimelineMetricsSink.java:emitMetrics(227)) - No live collector to
> send metrics to. Metrics to be sent will be discarded.
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)