[
https://issues.apache.org/jira/browse/FLINK-4041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15339637#comment-15339637
]
ASF GitHub Bot commented on FLINK-4041:
---------------------------------------
GitHub user mxm opened a pull request:
https://github.com/apache/flink/pull/2137
[FLINK-4041] reduce log level for JobManager->ResourceManager timeouts
These timeouts can be recovered and don't have to be logged as error with
an Exception.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/mxm/flink FLINK-4041
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/flink/pull/2137.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2137
----
commit f7ac8262fecae3de7520d6a7fb071d69e3001bb1
Author: Maximilian Michels <[email protected]>
Date: 2016-06-20T14:49:26Z
[FLINK-4041] reduce log level for JobManager->ResourceManager timeouts
These timeouts can be recovered and don't have to be logged as error
with an Exception.
----
> Failure while asking ResourceManager for RegisterResource
> ---------------------------------------------------------
>
> Key: FLINK-4041
> URL: https://issues.apache.org/jira/browse/FLINK-4041
> Project: Flink
> Issue Type: Bug
> Components: ResourceManager
> Affects Versions: 1.1.0
> Reporter: Robert Metzger
> Labels: test-stability
>
> In this build
> (https://s3.amazonaws.com/archive.travis-ci.org/jobs/136372462/log.txt), I
> got the following YARN Test failure:
> {code}
> 2016-06-09 10:21:42,336 INFO
> akka.remote.RemoteActorRefProvider$RemotingTerminator - Shutting down
> remote daemon.
> 2016-06-09 10:21:42,336 INFO
> akka.remote.RemoteActorRefProvider$RemotingTerminator - Remote daemon
> shut down; proceeding with flushing remote transports.
> 2016-06-09 10:21:42,355 INFO
> akka.remote.RemoteActorRefProvider$RemotingTerminator - Remoting shut
> down.
> 2016-06-09 10:21:42,376 ERROR org.apache.flink.yarn.YarnJobManager
> - Failure while asking ResourceManager for RegisterResource
> akka.pattern.AskTimeoutException: Ask timed out on
> [Actor[akka://flink/user/$c#1255104255]] after [10000 ms]
> at
> akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:333)
> at akka.actor.Scheduler$$anon$7.run(Scheduler.scala:117)
> at
> akka.actor.LightArrayRevolverScheduler$TaskHolder.run(Scheduler.scala:476)
> at
> akka.actor.LightArrayRevolverScheduler$$anonfun$close$1.apply(Scheduler.scala:282)
> at
> akka.actor.LightArrayRevolverScheduler$$anonfun$close$1.apply(Scheduler.scala:281)
> at scala.collection.Iterator$class.foreach(Iterator.scala:742)
> at scala.collection.AbstractIterator.foreach(Iterator.scala:1194)
> at scala.collection.IterableLike$class.foreach(IterableLike.scala:72)
> at scala.collection.AbstractIterable.foreach(Iterable.scala:54)
> at akka.actor.LightArrayRevolverScheduler.close(Scheduler.scala:280)
> at akka.actor.ActorSystemImpl.stopScheduler(ActorSystem.scala:688)
> at
> akka.actor.ActorSystemImpl$$anonfun$liftedTree2$1$1.apply$mcV$sp(ActorSystem.scala:617)
> at
> akka.actor.ActorSystemImpl$$anonfun$liftedTree2$1$1.apply(ActorSystem.scala:617)
> at
> akka.actor.ActorSystemImpl$$anonfun$liftedTree2$1$1.apply(ActorSystem.scala:617)
> at akka.actor.ActorSystemImpl$$anon$3.run(ActorSystem.scala:641)
> at
> akka.actor.ActorSystemImpl$TerminationCallbacks$$anonfun$run$1.runNext$1(ActorSystem.scala:808)
> at
> akka.actor.ActorSystemImpl$TerminationCallbacks$$anonfun$run$1.apply$mcV$sp(ActorSystem.scala:811)
> at
> akka.actor.ActorSystemImpl$TerminationCallbacks$$anonfun$run$1.apply(ActorSystem.scala:804)
> at
> akka.actor.ActorSystemImpl$TerminationCallbacks$$anonfun$run$1.apply(ActorSystem.scala:804)
> at akka.util.ReentrantGuard.withGuard(LockUtil.scala:15)
> at
> akka.actor.ActorSystemImpl$TerminationCallbacks.run(ActorSystem.scala:804)
> at
> akka.actor.ActorSystemImpl$$anonfun$terminationCallbacks$1.apply(ActorSystem.scala:638)
> at
> akka.actor.ActorSystemImpl$$anonfun$terminationCallbacks$1.apply(ActorSystem.scala:638)
> at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:32)
> at
> akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.processBatch$1(BatchingExecutor.scala:67)
> at
> akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.apply$mcV$sp(BatchingExecutor.scala:82)
> at
> akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.apply(BatchingExecutor.scala:59)
> at
> akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.apply(BatchingExecutor.scala:59)
> at
> scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:72)
> at akka.dispatch.BatchingExecutor$Batch.run(BatchingExecutor.scala:58)
> at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
> at
> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
> at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
> at
> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
> at
> scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
> at
> scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> 2016-06-09 10:21:42,376 INFO org.apache.flink.yarn.YarnJobManager
> - Shutdown completed. Stopping JVM.
> 2016-06-09 10:21:42,377 INFO
> org.apache.flink.runtime.webmonitor.StackTraceSampleCoordinator - Shutting
> down stack trace sample coordinator.
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)