Optimize finding of speculative tasks
-------------------------------------

                 Key: HADOOP-2874
                 URL: https://issues.apache.org/jira/browse/HADOOP-2874
             Project: Hadoop Core
          Issue Type: Improvement
          Components: mapred
            Reporter: Amar Kamat


Assuming HADOOP-2119 provides better data structures for handling running TIPs, 
finding new speculative tasks can be further optimized. Two of which could be 
1) {{conf.getMapSpeculativeExecution()}} and 
{{conf.getReduceSpeculativeExecution()}} should be moved to {{JobInProgress}}. 
A simple check for this boolean can prove useful before checking for 
speculative tasks. This will be useful for jobs with large maps and reducers 
where scanning all the TIPs  can be costly. 
2) Since the progress of a TIP changes only when 
{{TaskInProgress.recomputeProgress()}} is invoked, it makes more sense to check 
for speculation in {{JobInProgress.updateTaskStatus()}} and move the TIPs that 
can be speculated to the front of the running queue.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to