[ 
https://issues.apache.org/jira/browse/FLINK-885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14026644#comment-14026644
 ] 

Robert Metzger commented on FLINK-885:
--------------------------------------

I came across a similar issue while testing Flink on Amazon EMR today:
I cancelled the large test job.
{code}
06/10/2014 16:27:44:    Join(T3 Join: Iteration input) (7/8) switched to FAILED
java.lang.ArithmeticException: / by zero
        at 
eu.stratosphere.pact.runtime.hash.CompactingHashTable$HashTableProber.getMatchFor(CompactingHashTable.java:1089)
        at 
eu.stratosphere.pact.runtime.task.JoinWithSolutionSetSecondDriver.run(JoinWithSolutionSetSecondDriver.java:134)
        at 
eu.stratosphere.pact.runtime.task.RegularPactTask.run(RegularPactTask.java:505)
        at 
eu.stratosphere.pact.runtime.iterative.task.AbstractIterativePactTask.run(AbstractIterativePactTask.java:132)
        at 
eu.stratosphere.pact.runtime.iterative.task.IterationIntermediatePactTask.run(IterationIntermediatePactTask.java:84)
        at 
eu.stratosphere.pact.runtime.task.RegularPactTask.invoke(RegularPactTask.java:370)
        at 
eu.stratosphere.nephele.execution.RuntimeEnvironment.run(RuntimeEnvironment.java:284)
        at java.lang.Thread.run(Thread.java:744)
{code}

Do you think I should open a new Issue for that?

> Task cancellation fails in TempBarrier.close()
> ----------------------------------------------
>
>                 Key: FLINK-885
>                 URL: https://issues.apache.org/jira/browse/FLINK-885
>             Project: Flink
>          Issue Type: Bug
>            Reporter: GitHub Import
>              Labels: github-import
>             Fix For: pre-apache
>
>
> Another issue in line with 
> https://github.com/stratosphere/stratosphere/issues/154 and 
> https://github.com/stratosphere/stratosphere/issues/281:
> I got this issue while testing the testjob on the cluster with the current 
> 0.5-rc3.
> ```
> 42 ERROR eu.stratosphere.pact.runtime.task.RegularPactTask             - 
> Error closing temp barrier for input 1
> java.lang.IndexOutOfBoundsException: Index: 1, Size: 0
>         at java.util.ArrayList.rangeCheck(ArrayList.java:635)
>         at java.util.ArrayList.remove(ArrayList.java:474)
>         at 
> eu.stratosphere.pact.runtime.io.SpillingBuffer.moveAll(SpillingBuffer.java:188)
>         at 
> eu.stratosphere.pact.runtime.io.SpillingBuffer.close(SpillingBuffer.java:150)
>         at 
> eu.stratosphere.pact.runtime.task.TempBarrier.close(TempBarrier.java:121)
>         at 
> eu.stratosphere.pact.runtime.task.RegularPactTask.closeLocalStrategiesAndCaches(RegularPactTask.java:615)
>         at 
> eu.stratosphere.pact.runtime.task.RegularPactTask.cancel(RegularPactTask.java:404)
>         at 
> eu.stratosphere.nephele.taskmanager.runtime.RuntimeTask.cancelOrKillExecution(RuntimeTask.java:213)
>         at 
> eu.stratosphere.nephele.taskmanager.runtime.RuntimeTask.cancelExecution(RuntimeTask.java:174)
>         at 
> eu.stratosphere.nephele.taskmanager.TaskManager$2.run(TaskManager.java:592)
>         at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>         at java.lang.Thread.run(Thread.java:745)
> 10:43:38,741 INFO  eu.stratosphere.nephele.execution.ExecutionStateTransition 
>    - TM: ExecutionState set from RUNNING to CANCELING for task Join(T4 Join: 
> All order keys) (136/208)
> 10:43:38,741 INFO  eu.stratosphere.nephele.execution.ExecutionStateTransition 
>    - TM: ExecutionState set from CANCELING to CANCELED for task Join(T4 Join: 
> All order keys) (133/208)
> 10:43:38,738 INFO  eu.stratosphere.nephele.execution.ExecutionStateTransition 
>    - TM: ExecutionState set from CANCELING to CANCELED for task Join(T4 Join: 
> All order keys) (131/208)
> 10:43:38,738 ERROR eu.stratosphere.pact.runtime.task.RegularPactTask          
>    - Error closing temp barrier for input 1
> java.lang.IndexOutOfBoundsException: Index: 1, Size: 0
>         at java.util.ArrayList.rangeCheck(ArrayList.java:635)
>         at java.util.ArrayList.remove(ArrayList.java:474)
>         at 
> eu.stratosphere.pact.runtime.io.SpillingBuffer.moveAll(SpillingBuffer.java:188)
>         at 
> eu.stratosphere.pact.runtime.io.SpillingBuffer.close(SpillingBuffer.java:150)
>         at 
> eu.stratosphere.pact.runtime.task.TempBarrier.close(TempBarrier.java:121)
>         at 
> eu.stratosphere.pact.runtime.task.RegularPactTask.closeLocalStrategiesAndCaches(RegularPactTask.java:615)
>         at 
> eu.stratosphere.pact.runtime.task.RegularPactTask.cancel(RegularPactTask.java:404)
>         at 
> eu.stratosphere.nephele.taskmanager.runtime.RuntimeTask.cancelOrKillExecution(RuntimeTask.java:213)
>         at 
> eu.stratosphere.nephele.taskmanager.runtime.RuntimeTask.cancelExecution(RuntimeTask.java:174)
>         at 
> eu.stratosphere.nephele.taskmanager.TaskManager$2.run(TaskManager.java:592)
>         at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>         at java.lang.Thread.run(Thread.java:745)
> 10:43:38,756 INFO  eu.stratosphere.nephele.execution.ExecutionStateTransition 
>    - TM: ExecutionState set from CANCELING to CANCELED for task Join(T4 Join: 
> All order keys) (136/208)
> ```
> ---------------- Imported from GitHub ----------------
> Url: https://github.com/stratosphere/stratosphere/issues/885
> Created by: [rmetzger|https://github.com/rmetzger]
> Labels: bug, runtime, 
> Created at: Thu May 29 10:49:52 CEST 2014
> State: open



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to