[
https://issues.apache.org/jira/browse/FLINK-885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14026644#comment-14026644
]
Robert Metzger commented on FLINK-885:
--------------------------------------
I came across a similar issue while testing Flink on Amazon EMR today:
I cancelled the large test job.
{code}
06/10/2014 16:27:44: Join(T3 Join: Iteration input) (7/8) switched to FAILED
java.lang.ArithmeticException: / by zero
at
eu.stratosphere.pact.runtime.hash.CompactingHashTable$HashTableProber.getMatchFor(CompactingHashTable.java:1089)
at
eu.stratosphere.pact.runtime.task.JoinWithSolutionSetSecondDriver.run(JoinWithSolutionSetSecondDriver.java:134)
at
eu.stratosphere.pact.runtime.task.RegularPactTask.run(RegularPactTask.java:505)
at
eu.stratosphere.pact.runtime.iterative.task.AbstractIterativePactTask.run(AbstractIterativePactTask.java:132)
at
eu.stratosphere.pact.runtime.iterative.task.IterationIntermediatePactTask.run(IterationIntermediatePactTask.java:84)
at
eu.stratosphere.pact.runtime.task.RegularPactTask.invoke(RegularPactTask.java:370)
at
eu.stratosphere.nephele.execution.RuntimeEnvironment.run(RuntimeEnvironment.java:284)
at java.lang.Thread.run(Thread.java:744)
{code}
Do you think I should open a new Issue for that?
> Task cancellation fails in TempBarrier.close()
> ----------------------------------------------
>
> Key: FLINK-885
> URL: https://issues.apache.org/jira/browse/FLINK-885
> Project: Flink
> Issue Type: Bug
> Reporter: GitHub Import
> Labels: github-import
> Fix For: pre-apache
>
>
> Another issue in line with
> https://github.com/stratosphere/stratosphere/issues/154 and
> https://github.com/stratosphere/stratosphere/issues/281:
> I got this issue while testing the testjob on the cluster with the current
> 0.5-rc3.
> ```
> 42 ERROR eu.stratosphere.pact.runtime.task.RegularPactTask -
> Error closing temp barrier for input 1
> java.lang.IndexOutOfBoundsException: Index: 1, Size: 0
> at java.util.ArrayList.rangeCheck(ArrayList.java:635)
> at java.util.ArrayList.remove(ArrayList.java:474)
> at
> eu.stratosphere.pact.runtime.io.SpillingBuffer.moveAll(SpillingBuffer.java:188)
> at
> eu.stratosphere.pact.runtime.io.SpillingBuffer.close(SpillingBuffer.java:150)
> at
> eu.stratosphere.pact.runtime.task.TempBarrier.close(TempBarrier.java:121)
> at
> eu.stratosphere.pact.runtime.task.RegularPactTask.closeLocalStrategiesAndCaches(RegularPactTask.java:615)
> at
> eu.stratosphere.pact.runtime.task.RegularPactTask.cancel(RegularPactTask.java:404)
> at
> eu.stratosphere.nephele.taskmanager.runtime.RuntimeTask.cancelOrKillExecution(RuntimeTask.java:213)
> at
> eu.stratosphere.nephele.taskmanager.runtime.RuntimeTask.cancelExecution(RuntimeTask.java:174)
> at
> eu.stratosphere.nephele.taskmanager.TaskManager$2.run(TaskManager.java:592)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:745)
> 10:43:38,741 INFO eu.stratosphere.nephele.execution.ExecutionStateTransition
> - TM: ExecutionState set from RUNNING to CANCELING for task Join(T4 Join:
> All order keys) (136/208)
> 10:43:38,741 INFO eu.stratosphere.nephele.execution.ExecutionStateTransition
> - TM: ExecutionState set from CANCELING to CANCELED for task Join(T4 Join:
> All order keys) (133/208)
> 10:43:38,738 INFO eu.stratosphere.nephele.execution.ExecutionStateTransition
> - TM: ExecutionState set from CANCELING to CANCELED for task Join(T4 Join:
> All order keys) (131/208)
> 10:43:38,738 ERROR eu.stratosphere.pact.runtime.task.RegularPactTask
> - Error closing temp barrier for input 1
> java.lang.IndexOutOfBoundsException: Index: 1, Size: 0
> at java.util.ArrayList.rangeCheck(ArrayList.java:635)
> at java.util.ArrayList.remove(ArrayList.java:474)
> at
> eu.stratosphere.pact.runtime.io.SpillingBuffer.moveAll(SpillingBuffer.java:188)
> at
> eu.stratosphere.pact.runtime.io.SpillingBuffer.close(SpillingBuffer.java:150)
> at
> eu.stratosphere.pact.runtime.task.TempBarrier.close(TempBarrier.java:121)
> at
> eu.stratosphere.pact.runtime.task.RegularPactTask.closeLocalStrategiesAndCaches(RegularPactTask.java:615)
> at
> eu.stratosphere.pact.runtime.task.RegularPactTask.cancel(RegularPactTask.java:404)
> at
> eu.stratosphere.nephele.taskmanager.runtime.RuntimeTask.cancelOrKillExecution(RuntimeTask.java:213)
> at
> eu.stratosphere.nephele.taskmanager.runtime.RuntimeTask.cancelExecution(RuntimeTask.java:174)
> at
> eu.stratosphere.nephele.taskmanager.TaskManager$2.run(TaskManager.java:592)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:745)
> 10:43:38,756 INFO eu.stratosphere.nephele.execution.ExecutionStateTransition
> - TM: ExecutionState set from CANCELING to CANCELED for task Join(T4 Join:
> All order keys) (136/208)
> ```
> ---------------- Imported from GitHub ----------------
> Url: https://github.com/stratosphere/stratosphere/issues/885
> Created by: [rmetzger|https://github.com/rmetzger]
> Labels: bug, runtime,
> Created at: Thu May 29 10:49:52 CEST 2014
> State: open
--
This message was sent by Atlassian JIRA
(v6.2#6252)