[ 
https://issues.apache.org/jira/browse/TEZ-14?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14167844#comment-14167844
 ] 

Bikas Saha commented on TEZ-14:
-------------------------------

I have attached an initial port of the MR default speculator to Tez

Overall notes
1) Ported the DefaultSpeculator and estimator code to Tez and made some changes 
that made sense for Tez. Eg. remove container need tracking for M and R tasks 
because Tez has generic priority based ordering and scheduling. [~jlowe] 
[~jeagles] Could you please take a close look at this code.
2) Speculator runs per vertex and gets status updates from task attempts. 
3) State machines already had support for speculation and needed only minor bug 
fixes. [~jlowe][~jeagles] Do you know of any state machine fixes that were made 
in MR for speculation recently?

Improved  MockDAGAppMaster infra to add some basic test cases for useful and 
non-useful speculation. 

The speculator is not pluggable etc because this is mainly targeted to match 
legacy MR speculation. A more advanced speculator would need much more 
information and stats that are currently not readily available.

TBD - more tests, running speculative tasks at lower priority than normal tasks 
so that they dont block normal tasks of the same vertex,   counters/ATS-events, 
performance measurements/heuristics to check if this is safe to run on the 
AsyncDispatcher thread. [~jlowe][~jeagles] Do you have some idea of the 
performance of this code in Yahoo clusters?

[~hitesh] [~jeagles][~jlowe] Please review.



> Support for speculation of slow tasks
> -------------------------------------
>
>                 Key: TEZ-14
>                 URL: https://issues.apache.org/jira/browse/TEZ-14
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Bikas Saha
>            Assignee: Bikas Saha
>         Attachments: TEZ-14.1.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to