Stephan Ewen created FLINK-6440: ----------------------------------- Summary: Noisy logs from metric fetcher Key: FLINK-6440 URL: https://issues.apache.org/jira/browse/FLINK-6440 Project: Flink Issue Type: Bug Components: Webfrontend Affects Versions: 1.3.0 Reporter: Stephan Ewen Priority: Critical Fix For: 1.3.0
In cases where TaskManagers fail, the web frontend in the Job Manager starts logging the exception below every few seconds. I labeled this as critical, because it actually makes debugging in such a situation complicated through a log that is flooded with noise. {code} 2017-05-03 19:37:07,823 WARN org.apache.flink.runtime.webmonitor.metrics.MetricFetcher - Fetching metrics failed. akka.pattern.AskTimeoutException: Ask timed out on [Actor[akka.tcp://flink@herman:52175/user/MetricQueryService_136f717a6b91e248282cb2937d22088c]] after [10000 ms] at akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:334) at akka.actor.Scheduler$$anon$7.run(Scheduler.scala:117) at scala.concurrent.Future$InternalCallbackExecutor$.scala$concurrent$Future$InternalCallbackExecutor$$unbatchedExecute(Future.scala:694) at scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:691) at akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(Scheduler.scala:474) at akka.actor.LightArrayRevolverScheduler$$anon$8.executeBucket$1(Scheduler.scala:425) at akka.actor.LightArrayRevolverScheduler$$anon$8.nextTick(Scheduler.scala:429) at akka.actor.LightArrayRevolverScheduler$$anon$8.run(Scheduler.scala:381) at java.lang.Thread.run(Thread.java:745) {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346)