[ 
https://issues.apache.org/jira/browse/AMBARI-20071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15872420#comment-15872420
 ] 

Aravindan Vijayan commented on AMBARI-20071:
--------------------------------------------

After patch.

{code}
2017-02-17 19:49:28,224 INFO  timeline.HadoopTimelineMetricsSink 
(HadoopTimelineMetricsSink.java:init(80)) - Initializing Timeline metrics sink.
2017-02-17 19:49:28,224 INFO  timeline.HadoopTimelineMetricsSink 
(HadoopTimelineMetricsSink.java:init(98)) - Identified hostname = 
avijayan-ams-1.openstacklocal, serviceName = datanode
2017-02-17 19:49:28,380 INFO  timeline.HadoopTimelineMetricsSink 
(HadoopTimelineMetricsSink.java:init(123)) - No suitable collector found.
2017-02-17 19:49:28,393 INFO  impl.MetricsSinkAdapter 
(MetricsSinkAdapter.java:start(206)) - Sink timeline started
2017-02-17 19:50:38,518 INFO  timeline.HadoopTimelineMetricsSink 
(AbstractTimelineMetricsSink.java:emitMetrics(230)) - No live collector to send 
metrics to. Metrics to be sent will be discarded. This message will be skipped 
for the next 20 times.
2017-02-17 19:53:28,712 INFO  timeline.HadoopTimelineMetricsSink 
(AbstractTimelineMetricsSink.java:emitMetrics(230)) - No live collector to send 
metrics to. Metrics to be sent will be discarded. This message will be skipped 
for the next 20 times.
2017-02-17 19:57:18,524 INFO  timeline.HadoopTimelineMetricsSink 
(AbstractTimelineMetricsSink.java:emitMetrics(230)) - No live collector to send 
metrics to. Metrics to be sent will be discarded. This message will be skipped 
for the next 20 times.
{code}

> Hadoop metrics sink prints lots of logs if collector is unavailable
> -------------------------------------------------------------------
>
>                 Key: AMBARI-20071
>                 URL: https://issues.apache.org/jira/browse/AMBARI-20071
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-metrics
>    Affects Versions: 2.5.0
>            Reporter: Aravindan Vijayan
>            Assignee: Aravindan Vijayan
>            Priority: Critical
>             Fix For: 2.5.0
>
>
> during debugging, found the metrics sink prints lots of such messages in RM 
> log every second, which makes logs rotates and purge fast.
> {code}
> 2017-02-16 19:05:48,896 INFO  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) - 
> Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. 
> Removing it from list of know live collector hosts : []
> 2017-02-16 19:05:48,896 WARN  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399)) - Couldn't 
> find any live collectors. Returning null
> 2017-02-16 19:05:48,896 WARN  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:emitMetrics(227)) - No live collector to 
> send metrics to. Metrics to be sent will be discarded.
> 2017-02-16 19:05:50,901 WARN  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476))
>  - Unable to connect to collector to find live nodes.
> 2017-02-16 19:05:50,901 INFO  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) - 
> Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. 
> Removing it from list of know live collector hosts : []
> 2017-02-16 19:05:50,901 WARN  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399)) - Couldn't 
> find any live collectors. Returning null
> 2017-02-16 19:05:50,902 WARN  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:emitMetrics(227)) - No live collector to 
> send metrics to. Metrics to be sent will be discarded.
> 2017-02-16 19:06:48,896 WARN  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476))
>  - Unable to connect to collector to find live nodes.
> 2017-02-16 19:06:48,897 INFO  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) - 
> Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. 
> Removing it from list of know live collector hosts : []
> 2017-02-16 19:06:48,897 INFO  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(359)) - No live 
> collectors from configuration. Requesting zookeeper...
> 2017-02-16 19:06:48,988 INFO  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(369)) - No new 
> collector was found from Zookeeper. Will not request zookeeper for 120000 
> millis
> 2017-02-16 19:06:48,989 INFO  
> availability.MetricSinkWriteShardHostnameHashingStrategy 
> (MetricSinkWriteShardHostnameHashingStrategy.java:findCollectorShard(42)) - 
> Calculated collector shard ctr-e129-1487033772569-2546-01-000004.hwx.site 
> based on hostname: ctr-e129-1487033772569-2546-01-000003.hwx.site
> 2017-02-16 19:06:59,004 INFO  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:emitMetrics(217)) - Removing collector 
> ctr-e129-1487033772569-2546-01-000004.hwx.site from allKnownLiveCollectors.
> 2017-02-16 19:07:01,009 WARN  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476))
>  - Unable to connect to collector to find live nodes.
> 2017-02-16 19:07:01,010 INFO  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) - 
> Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. 
> Removing it from list of know live collector hosts : []
> 2017-02-16 19:07:01,010 WARN  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399)) - Couldn't 
> find any live collectors. Returning null
> 2017-02-16 19:07:01,010 WARN  timeline.HadoopTimelineMetricsSink 
> (AbstractTimelineMetricsSink.java:emitMetrics(227)) - No live collector to 
> send metrics to. Metrics to be sent will be discarded.
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to