Tassapol Athiapinya created TEZ-878:
---------------------------------------
Summary: Run TEZ-823
testInputFailureCausesRerunOfTwoVerticesWithoutExit in real cluster results in
NPE at TaskScheduler.assignContainer
Key: TEZ-878
URL: https://issues.apache.org/jira/browse/TEZ-878
Project: Apache Tez
Issue Type: Bug
Affects Versions: 0.3.0
Reporter: Tassapol Athiapinya
Priority: Critical
Fix For: 0.3.0
I have single-node cluster. I run tez DAG similar to
testInputFailureCausesRerunOfTwoVerticesWithoutExit unit test added in TEZ-823.
DAG does not exit. There is an NPE in Tez AM logs:
{code}
2014-02-21 23:42:58,123 INFO [AsyncDispatcher event handler]
org.apache.tez.dag.app.dag.impl.TaskImpl: task_1393025990757_0002_1_00_000000
Task Transitioned from SUCCEEDED to SCHEDULED
2014-02-21 23:42:58,124 ERROR [DelayedContainerManager]
org.apache.hadoop.yarn.YarnUncaughtExceptionHandler: Thread
Thread[DelayedContainerManager,5,main] threw an Exception.
java.lang.NullPointerException
at
org.apache.tez.dag.app.rm.TaskScheduler.assignContainer(TaskScheduler.java:1084)
at
org.apache.tez.dag.app.rm.TaskScheduler.access$500(TaskScheduler.java:83)
at
org.apache.tez.dag.app.rm.TaskScheduler$ContainerAssigner.doBookKeepingForAssignedContainer(TaskScheduler.java:1326)
at
org.apache.tez.dag.app.rm.TaskScheduler$NodeLocalContainerAssigner.assignReUsedContainer(TaskScheduler.java:1351)
at
org.apache.tez.dag.app.rm.TaskScheduler.assignReUsedContainerWithLocation(TaskScheduler.java:1231)
at
org.apache.tez.dag.app.rm.TaskScheduler.assignReUsedContainersWithLocation(TaskScheduler.java:1193)
at
org.apache.tez.dag.app.rm.TaskScheduler.tryAssignReUsedContainers(TaskScheduler.java:505)
at
org.apache.tez.dag.app.rm.TaskScheduler.access$1000(TaskScheduler.java:83)
at
org.apache.tez.dag.app.rm.TaskScheduler$DelayedContainerManager.doAssignAll(TaskScheduler.java:1554)
at
org.apache.tez.dag.app.rm.TaskScheduler$DelayedContainerManager.run(TaskScheduler.java:1456)
{code}
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)