[ https://issues.apache.org/jira/browse/AMBARI-20071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Aravindan Vijayan updated AMBARI-20071: --------------------------------------- Attachment: AMBARI-20071.patch > Hadoop metrics sink prints lots of logs if collector is unavailable > ------------------------------------------------------------------- > > Key: AMBARI-20071 > URL: https://issues.apache.org/jira/browse/AMBARI-20071 > Project: Ambari > Issue Type: Bug > Components: ambari-metrics > Affects Versions: 2.5.0 > Reporter: Aravindan Vijayan > Assignee: Aravindan Vijayan > Priority: Critical > Fix For: 2.5.0 > > Attachments: AMBARI-20071.patch > > > during debugging, found the metrics sink prints lots of such messages in RM > log every second, which makes logs rotates and purge fast. > {code} > 2017-02-16 19:05:48,896 INFO timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) - > Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. > Removing it from list of know live collector hosts : [] > 2017-02-16 19:05:48,896 WARN timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399)) - Couldn't > find any live collectors. Returning null > 2017-02-16 19:05:48,896 WARN timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:emitMetrics(227)) - No live collector to > send metrics to. Metrics to be sent will be discarded. > 2017-02-16 19:05:50,901 WARN timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476)) > - Unable to connect to collector to find live nodes. > 2017-02-16 19:05:50,901 INFO timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) - > Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. > Removing it from list of know live collector hosts : [] > 2017-02-16 19:05:50,901 WARN timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399)) - Couldn't > find any live collectors. Returning null > 2017-02-16 19:05:50,902 WARN timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:emitMetrics(227)) - No live collector to > send metrics to. Metrics to be sent will be discarded. > 2017-02-16 19:06:48,896 WARN timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476)) > - Unable to connect to collector to find live nodes. > 2017-02-16 19:06:48,897 INFO timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) - > Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. > Removing it from list of know live collector hosts : [] > 2017-02-16 19:06:48,897 INFO timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:findPreferredCollectHost(359)) - No live > collectors from configuration. Requesting zookeeper... > 2017-02-16 19:06:48,988 INFO timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:findPreferredCollectHost(369)) - No new > collector was found from Zookeeper. Will not request zookeeper for 120000 > millis > 2017-02-16 19:06:48,989 INFO > availability.MetricSinkWriteShardHostnameHashingStrategy > (MetricSinkWriteShardHostnameHashingStrategy.java:findCollectorShard(42)) - > Calculated collector shard ctr-e129-1487033772569-2546-01-000004.hwx.site > based on hostname: ctr-e129-1487033772569-2546-01-000003.hwx.site > 2017-02-16 19:06:59,004 INFO timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:emitMetrics(217)) - Removing collector > ctr-e129-1487033772569-2546-01-000004.hwx.site from allKnownLiveCollectors. > 2017-02-16 19:07:01,009 WARN timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476)) > - Unable to connect to collector to find live nodes. > 2017-02-16 19:07:01,010 INFO timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419)) - > Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. > Removing it from list of know live collector hosts : [] > 2017-02-16 19:07:01,010 WARN timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399)) - Couldn't > find any live collectors. Returning null > 2017-02-16 19:07:01,010 WARN timeline.HadoopTimelineMetricsSink > (AbstractTimelineMetricsSink.java:emitMetrics(227)) - No live collector to > send metrics to. Metrics to be sent will be discarded. > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346)