[ https://issues.apache.org/jira/browse/FLINK-3281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15114659#comment-15114659 ]
Chengxiang Li commented on FLINK-3281: -------------------------------------- [~fsander], thanks for finding this, i would work on it. > IndexOutOfBoundsException when range-partitioning empty DataSet > ---------------------------------------------------------------- > > Key: FLINK-3281 > URL: https://issues.apache.org/jira/browse/FLINK-3281 > Project: Flink > Issue Type: Bug > Components: Distributed Runtime, Local Runtime > Reporter: Fridtjof Sander > > Code: > {code} > import org.apache.flink.api.scala._ > object RangePartitionOnEmptyDataSet { > def main(args:Array[String]) = { > val env = ExecutionEnvironment.getExecutionEnvironment > env > .fromCollection(Seq[Tuple1[String]]()) > .partitionByRange(0) > .collect() > } > } > {code} > Output: > {noformat} > 01/24/2016 16:24:36 Job execution switched to status RUNNING. > 01/24/2016 16:24:36 DataSource (at > RangePartitionOnEmptyDataSet$.main(RangePartitionOnEmptyDataSet.scala:9) > (org.apache.flink.api.java.io.CollectionInputFormat))(1/1) switched to > SCHEDULED > 01/24/2016 16:24:36 DataSource (at > RangePartitionOnEmptyDataSet$.main(RangePartitionOnEmptyDataSet.scala:9) > (org.apache.flink.api.java.io.CollectionInputFormat))(1/1) switched to > DEPLOYING > 01/24/2016 16:24:36 DataSource (at > RangePartitionOnEmptyDataSet$.main(RangePartitionOnEmptyDataSet.scala:9) > (org.apache.flink.api.java.io.CollectionInputFormat))(1/1) switched to > RUNNING > 01/24/2016 16:24:36 RangePartition: LocalSample(1/1) switched to SCHEDULED > 01/24/2016 16:24:36 RangePartition: LocalSample(1/1) switched to DEPLOYING > 01/24/2016 16:24:36 DataSource (at > RangePartitionOnEmptyDataSet$.main(RangePartitionOnEmptyDataSet.scala:9) > (org.apache.flink.api.java.io.CollectionInputFormat))(1/1) switched to > FINISHED > 01/24/2016 16:24:36 RangePartition: PreparePartition(1/1) switched to > SCHEDULED > 01/24/2016 16:24:36 RangePartition: PreparePartition(1/1) switched to > DEPLOYING > 01/24/2016 16:24:36 RangePartition: LocalSample(1/1) switched to RUNNING > 01/24/2016 16:24:36 RangePartition: PreparePartition(1/1) switched to > RUNNING > 01/24/2016 16:24:36 RangePartition: GlobalSample(1/1) switched to SCHEDULED > 01/24/2016 16:24:36 RangePartition: GlobalSample(1/1) switched to DEPLOYING > 01/24/2016 16:24:36 RangePartition: LocalSample(1/1) switched to FINISHED > 01/24/2016 16:24:36 RangePartition: GlobalSample(1/1) switched to RUNNING > 01/24/2016 16:24:36 RangePartition: Histogram(1/1) switched to SCHEDULED > 01/24/2016 16:24:36 RangePartition: Histogram(1/1) switched to DEPLOYING > 01/24/2016 16:24:36 RangePartition: GlobalSample(1/1) switched to FINISHED > 01/24/2016 16:24:36 RangePartition: Histogram(1/1) switched to RUNNING > 01/24/2016 16:24:37 RangePartition: Histogram(1/1) switched to FAILED > java.lang.IndexOutOfBoundsException: Index: 0, Size: 0 > at java.util.ArrayList.rangeCheck(ArrayList.java:653) > at java.util.ArrayList.get(ArrayList.java:429) > at > org.apache.flink.runtime.operators.udf.RangeBoundaryBuilder.mapPartition(RangeBoundaryBuilder.java:66) > at > org.apache.flink.runtime.operators.MapPartitionDriver.run(MapPartitionDriver.java:98) > at org.apache.flink.runtime.operators.BatchTask.run(BatchTask.java:486) > at > org.apache.flink.runtime.operators.BatchTask.invoke(BatchTask.java:351) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:561) > at java.lang.Thread.run(Thread.java:745) > 01/24/2016 16:24:37 Job execution switched to status FAILING. > java.lang.IndexOutOfBoundsException: Index: 0, Size: 0 > at java.util.ArrayList.rangeCheck(ArrayList.java:653) > at java.util.ArrayList.get(ArrayList.java:429) > at > org.apache.flink.runtime.operators.udf.RangeBoundaryBuilder.mapPartition(RangeBoundaryBuilder.java:66) > at > org.apache.flink.runtime.operators.MapPartitionDriver.run(MapPartitionDriver.java:98) > at org.apache.flink.runtime.operators.BatchTask.run(BatchTask.java:486) > at > org.apache.flink.runtime.operators.BatchTask.invoke(BatchTask.java:351) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:561) > at java.lang.Thread.run(Thread.java:745) > 01/24/2016 16:24:37 RangePartition: PreparePartition(1/1) switched to > CANCELING > 01/24/2016 16:24:37 RangePartition: Partition(1/4) switched to CANCELED > 01/24/2016 16:24:37 RangePartition: Partition(2/4) switched to CANCELED > 01/24/2016 16:24:37 RangePartition: Partition(3/4) switched to CANCELED > 01/24/2016 16:24:37 RangePartition: Partition(4/4) switched to CANCELED > 01/24/2016 16:24:37 CHAIN Partition -> FlatMap (FlatMap at > collect(DataSet.scala:542))(1/4) switched to CANCELED > 01/24/2016 16:24:37 CHAIN Partition -> FlatMap (FlatMap at > collect(DataSet.scala:542))(2/4) switched to CANCELED > 01/24/2016 16:24:37 CHAIN Partition -> FlatMap (FlatMap at > collect(DataSet.scala:542))(3/4) switched to CANCELED > 01/24/2016 16:24:37 CHAIN Partition -> FlatMap (FlatMap at > collect(DataSet.scala:542))(4/4) switched to CANCELED > 01/24/2016 16:24:37 RangePartition: PreparePartition(1/1) switched to > CANCELED > 01/24/2016 16:24:37 DataSink > (org.apache.flink.api.java.io.DiscardingOutputFormat@525b461a)(1/4) switched > to CANCELED > 01/24/2016 16:24:37 DataSink > (org.apache.flink.api.java.io.DiscardingOutputFormat@525b461a)(2/4) switched > to CANCELED > 01/24/2016 16:24:37 DataSink > (org.apache.flink.api.java.io.DiscardingOutputFormat@525b461a)(3/4) switched > to CANCELED > 01/24/2016 16:24:37 DataSink > (org.apache.flink.api.java.io.DiscardingOutputFormat@525b461a)(4/4) switched > to CANCELED > 01/24/2016 16:24:37 Job execution switched to status FAILED. > Exception in thread "main" > org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$5.apply$mcV$sp(JobManager.scala:570) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$5.apply(JobManager.scala:516) > at > org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$5.apply(JobManager.scala:516) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) > at > scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) > at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) > at > akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) > at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > at > scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) > at > scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) > at > scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) > Caused by: java.lang.IndexOutOfBoundsException: Index: 0, Size: 0 > at java.util.ArrayList.rangeCheck(ArrayList.java:653) > at java.util.ArrayList.get(ArrayList.java:429) > at > org.apache.flink.runtime.operators.udf.RangeBoundaryBuilder.mapPartition(RangeBoundaryBuilder.java:66) > at > org.apache.flink.runtime.operators.MapPartitionDriver.run(MapPartitionDriver.java:98) > at org.apache.flink.runtime.operators.BatchTask.run(BatchTask.java:486) > at > org.apache.flink.runtime.operators.BatchTask.invoke(BatchTask.java:351) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:561) > at java.lang.Thread.run(Thread.java:745) > Process finished with exit code 1 > {noformat} > The access happens in {{RangeBoundaryBuilder.java:66}}. > Sadly, I don't know enough about this to fix it in reasonable time. > [~chengxiang li] maybe? -- This message was sent by Atlassian JIRA (v6.3.4#6332)