[ 
https://issues.apache.org/jira/browse/MAPREDUCE-508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-508.
----------------------------------------

    Resolution: Fixed

This has been essentially fixed already but I'm too lazy to look up the jira.

> dynamic heartbeat interval for the locality-aware task scheduling
> -----------------------------------------------------------------
>
>                 Key: MAPREDUCE-508
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-508
>             Project: Hadoop Map/Reduce
>          Issue Type: Wish
>            Reporter: Leitao Guo
>            Priority: Minor
>
> In current hadoop release (0.17.0), there is no special scheduling policy for 
> those tasktrackers who have no data for some jobs. So, there would be 
> inefficient in some senarios. For example, tasktracker A has the data for a 
> job, but tasktracker B, which has no data for this job, sends the heartbeat 
> message to the jobtracker for a new task before tasktrack A. The task may be 
> scheduled to B instead of A. While Jobtracker has to find a new task for 
> tasktracker A when A ask for a new task. 
> In this situation, if jobtracker has some reservation policy, such as reserve 
> the task for tasktracker A and let B ask for new task in the next heartbeat 
> message, that would be more efficient. Because before tasktracker B asking 
> for new task the second time, tasktracker A has applied for a new task and 
> jobtracker has scheduled the task to A.
> Here is a rough idea to deal with the senario above:
> (1) Jobtracker receives the heartbeat message sent by tasktracker B, which 
> has no data for any job.
> (2) Jobtracker send response message to tasktracker B with a new heartbeat 
> message interval, but does not schedule new task to B.  The new heartbeat 
> interval should be shorter the current heartbeat interval, for example, 
> current_heartbeat_interval/2.
> (3) Tasktracker B receive the response from jobtracker, and sends another 
> heartbeat message for a new task after a period of 
> current_heartbeat_interval/2 .
> (4) Jobtracker then find a new task for tasktracker B.
> This is just an primary idea for the improvement of the locality-aware 
> scheduling. Any comments are welcome.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to