Till Rohrmann created FLINK-1925:
------------------------------------

             Summary: Split SubmitTask method up into two phases: Receive TDD 
and instantiation of TDD
                 Key: FLINK-1925
                 URL: https://issues.apache.org/jira/browse/FLINK-1925
             Project: Flink
          Issue Type: Improvement
            Reporter: Till Rohrmann
            Assignee: Till Rohrmann


ResearchGate reported that a job times out while submitting tasks to the 
TaskManager. The reason is that the JobManager expects a TaskOperationResult 
response upon submitting a task to the TM. The TM downloads then the required 
jars from the JM which blocks the actor thread and can take a very long time if 
many TMs download from the JM. Due to this, the SubmitTask future throws a 
TimeOutException.

A possible solution could be that the TM eagerly acknowledges the reception of 
the SubmitTask message and executes the task initialization within a future. 
The future will upon completion send a UpdateTaskExecutionState message to the 
JM which switches the state of the task from deploying to running. This means 
that the handler of SubmitTask future in {{Execution}} won't change the state 
of the task.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to