[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15309941#comment-15309941 ] ASF GitHub Bot commented on FLINK-3994: --- Github user mxm commented on the pull request: https://github.com/apache/flink/pull/2056 Thanks for fixing! > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15309020#comment-15309020 ] ASF GitHub Bot commented on FLINK-3994: --- Github user asfgit closed the pull request at: https://github.com/apache/flink/pull/2056 > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15309017#comment-15309017 ] ASF GitHub Bot commented on FLINK-3994: --- Github user chiwanpark commented on the pull request: https://github.com/apache/flink/pull/2056 Merging... > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307904#comment-15307904 ] ASF GitHub Bot commented on FLINK-3994: --- Github user chiwanpark commented on the pull request: https://github.com/apache/flink/pull/2056 Thanks for clarifying @StephanEwen! I'll merge this after Travis succeed. > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307899#comment-15307899 ] ASF GitHub Bot commented on FLINK-3994: --- Github user StephanEwen commented on the pull request: https://github.com/apache/flink/pull/2056 There is actually a problem with the way the Scala Tests are written: The code in the class that is outside the "it should" clauses is executed before the "before" function. That is why the tests go against a LocalEnvironment, rather than the test context environment. So Chiwan's patch will actually fix it, only for different reasons than initially expected. > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307859#comment-15307859 ] ASF GitHub Bot commented on FLINK-3994: --- Github user StephanEwen commented on the pull request: https://github.com/apache/flink/pull/2056 It is a bit tricky to understand what happens when with these tests using stacked traits... > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307852#comment-15307852 ] ASF GitHub Bot commented on FLINK-3994: --- Github user StephanEwen commented on the pull request: https://github.com/apache/flink/pull/2056 The limit should be set to 4 by the test tools already. I think the issue may be that the Execution Environment was prior to Chiwan's change acquired before the context was properly set by the test tool superclass. > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307800#comment-15307800 ] ASF GitHub Bot commented on FLINK-3994: --- Github user chiwanpark commented on the pull request: https://github.com/apache/flink/pull/2056 Thanks for guide @mxm. I'll set upper limit for the parallelism to 4. > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307721#comment-15307721 ] ASF GitHub Bot commented on FLINK-3994: --- Github user mxm commented on the pull request: https://github.com/apache/flink/pull/2056 I'm not sure whether the changes fix the problem. I think you were lucky with the build machine on Travis :) The issue on Travis is that the `ExecutionEnvironment` defaults to using the number of cores reported by `Runtime.getRuntime().availableProcessors()` as task slots. On Travis this can be `32`. We need to adjust the number of network buffers correctly. In addition, setting an upper limit for the parallelism for tests would also make sense. > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307691#comment-15307691 ] ASF GitHub Bot commented on FLINK-3994: --- GitHub user chiwanpark opened a pull request: https://github.com/apache/flink/pull/2056 [FLINK-3994] [ml, tests] Fix flaky KNN integration tests This PR is related to flaky KNN integration tests. The problem is caused by sharing `ExecutionEnvironment` between test cases. I'm not sure about exact reason. This PR makes each test case have own `ExecutionEnvironment`. Tests on my local machine and my Travis-CI [1] is passed with this PR. I have some doubt because this is not essential fix for the problem. AFAIK and @StephanEwen said, sharing `ExecutionEnvironment` should be supported. Addtionally, `mvn clean verify` has passed without this PR on my local machine. If there are any other opinions, please leave comment. [1]: https://travis-ci.org/chiwanpark/flink/builds/134104491 p.s. we need to re-write commit message. You can merge this pull request into a Git repository by running: $ git pull https://github.com/chiwanpark/flink hotfix-ml-test Alternatively you can review and apply these changes as the patch at: https://github.com/apache/flink/pull/2056.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #2056 commit a47ae8481bcbac2c490386089ee6b1e740f3a1f4 Author: Chiwan ParkDate: 2016-05-31T08:50:05Z [hotfix] [ml] Fix flaky KNN integration tests > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307607#comment-15307607 ] Maximilian Michels commented on FLINK-3994: --- Thanks [~chiwanpark]! > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > Fix For: 1.1.0 > > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307537#comment-15307537 ] Chiwan Park commented on FLINK-3994: I'm looking into this issue. I've assigned myself. > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Assignee: Chiwan Park >Priority: Critical > Labels: test-stability > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (FLINK-3994) Instable KNNITSuite
[ https://issues.apache.org/jira/browse/FLINK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15307535#comment-15307535 ] Ufuk Celebi commented on FLINK-3994: Thanks for looking into this. Are you working on this [~chiwanpark] or [~mxm]? Both of commented on the mailing list. Would be great if one of you assigns himself so we can fix this soon. If you don't have time, I can also look into it. > Instable KNNITSuite > --- > > Key: FLINK-3994 > URL: https://issues.apache.org/jira/browse/FLINK-3994 > Project: Flink > Issue Type: Bug > Components: Machine Learning Library, Tests >Affects Versions: 1.1.0 >Reporter: Chiwan Park >Priority: Critical > Labels: test-stability > > KNNITSuite fails in Travis-CI with following error: > {code} > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:806) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:752) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) > ... > Cause: java.io.IOException: Insufficient number of network buffers: > required 32, but only 4 available. The total number of network buffers is > currently set to 2048. You can increase this number by setting the > configuration key 'taskmanager.network.numberOfBuffers'. > at > org.apache.flink.runtime.io.network.buffer.NetworkBufferPool.createBufferPool(NetworkBufferPool.java:196) > at > org.apache.flink.runtime.io.network.NetworkEnvironment.registerTask(NetworkEnvironment.java:327) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:497) > at java.lang.Thread.run(Thread.java:745) > ... > {code} > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064237/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064236/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134064235/log.txt > https://s3.amazonaws.com/archive.travis-ci.org/jobs/134052961/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)