[ 
https://issues.apache.org/jira/browse/TEZ-2163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14347539#comment-14347539
 ] 

Siddharth Seth commented on TEZ-2163:
-------------------------------------

Mostly looks good. One change is required on how the speculator event gets sent 
out - RUNNING needs to go out before the status update, which happens in the 
START_WAIT to RUNNING transition. The statusUpdate transition would need to 
check if RUNNING has been sent out yet, and send it accordingly.
This issue likely only affects branch 2003, but it's good to fix in any case.

> Task status update should be handled in the START_WAIT state
> ------------------------------------------------------------
>
>                 Key: TEZ-2163
>                 URL: https://issues.apache.org/jira/browse/TEZ-2163
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: Siddharth Seth
>            Assignee: Jeff Zhang
>            Priority: Critical
>         Attachments: TEZ-2163-1.patch
>
>
> It;s possible for a task to send in a STATUS_UPDATE before the 
> TA_STARTED_REMOTELY message is processed within the AM.
> {code}
> 2015-02-27 13:21:15,491 ERROR [Dispatcher thread: Central] 
> impl.TaskAttemptImpl: Can't handle this event at current state for 
> attempt_1424502260528_0177_5_03_000223_0
> org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event: 
> TA_STATUS_UPDATE at START_WAIT
>   at 
> org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:305)
>   at 
> org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
>   at 
> org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
>   at 
> org.apache.tez.dag.app.dag.impl.TaskAttemptImpl.handle(TaskAttemptImpl.java:670)
>   at 
> org.apache.tez.dag.app.dag.impl.TaskAttemptImpl.handle(TaskAttemptImpl.java:112)
>   at 
> org.apache.tez.dag.app.DAGAppMaster$TaskAttemptEventDispatcher.handle(DAGAppMaster.java:1835)
>   at 
> org.apache.tez.dag.app.DAGAppMaster$TaskAttemptEventDispatcher.handle(DAGAppMaster.java:1820)
>   at org.apache.tez.common.AsyncDispatcher.dispatch(AsyncDispatcher.java:184)
>   at org.apache.tez.common.AsyncDispatcher$1.run(AsyncDispatcher.java:115)
>   at java.lang.Thread.run(Thread.java:745)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to