[
https://issues.apache.org/jira/browse/TEZ-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14153653#comment-14153653
]
Bikas Saha commented on TEZ-1592:
---------------------------------
This likely not the same issue because in this jiras case the app master should
die with a stack trace before launching the tasks.
Here tasks have launched and are waiting for input so it could be a different
issue with shuffle task starting before their inputs are fully determined. That
is intermittent and not fixed in 0.5.1. It will be fixed in 0.5.2. 0.5.1 is
mainly to make the APIs out in a public release so that dynamic partition
pruning can get committed in Hive.
The stacks are not useful. I would recommend opening a new bug and attaching
the AM and task logs. Also it would be great if you can provide the DAG like in
https://issues.apache.org/jira/browse/TEZ-1612.
> Vertex should wait for all initializers to finish before moving to INITED
> state
> -------------------------------------------------------------------------------
>
> Key: TEZ-1592
> URL: https://issues.apache.org/jira/browse/TEZ-1592
> Project: Apache Tez
> Issue Type: Bug
> Reporter: Siddharth Seth
> Assignee: Siddharth Seth
> Priority: Blocker
> Fix For: 0.5.1
>
> Attachments: DagAppMasterJstack.txt, TEZ-1592.1.txt,
> TezChild1Jstack.txt, TezChild2Jstack.txt, TezChild3Jstack.txt
>
>
> Reported by [~vikram.dixit]
> When using multiple initializers, the following stack trace is seen at times.
> {code}
> 2014-09-17 15:05:00,406 ERROR [AsyncDispatcher event handler]
> org.apache.tez.dag.app.dag.impl.VertexImpl: Can't handle Invalid event
> V_ROOT_INPUT_INITIALIZED on vertex Map 2 with vertexId
> vertex_1410991351910_0002_8_01 at current state RUNNING
> org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event:
> V_ROOT_INPUT_INITIALIZED at RUNNING
> at
> org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:305)
> at
> org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
> at
> org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
> at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:1337)
> at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:168)
> at
> org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1641)
> at
> org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1627)
> at
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:173)
> at
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:106)
> at java.lang.Thread.run(Thread.java:662)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)