[
https://issues.apache.org/jira/browse/MAPREDUCE-508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Allen Wittenauer resolved MAPREDUCE-508.
----------------------------------------
Resolution: Fixed
This has been essentially fixed already but I'm too lazy to look up the jira.
> dynamic heartbeat interval for the locality-aware task scheduling
> -----------------------------------------------------------------
>
> Key: MAPREDUCE-508
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-508
> Project: Hadoop Map/Reduce
> Issue Type: Wish
> Reporter: Leitao Guo
> Priority: Minor
>
> In current hadoop release (0.17.0), there is no special scheduling policy for
> those tasktrackers who have no data for some jobs. So, there would be
> inefficient in some senarios. For example, tasktracker A has the data for a
> job, but tasktracker B, which has no data for this job, sends the heartbeat
> message to the jobtracker for a new task before tasktrack A. The task may be
> scheduled to B instead of A. While Jobtracker has to find a new task for
> tasktracker A when A ask for a new task.
> In this situation, if jobtracker has some reservation policy, such as reserve
> the task for tasktracker A and let B ask for new task in the next heartbeat
> message, that would be more efficient. Because before tasktracker B asking
> for new task the second time, tasktracker A has applied for a new task and
> jobtracker has scheduled the task to A.
> Here is a rough idea to deal with the senario above:
> (1) Jobtracker receives the heartbeat message sent by tasktracker B, which
> has no data for any job.
> (2) Jobtracker send response message to tasktracker B with a new heartbeat
> message interval, but does not schedule new task to B. The new heartbeat
> interval should be shorter the current heartbeat interval, for example,
> current_heartbeat_interval/2.
> (3) Tasktracker B receive the response from jobtracker, and sends another
> heartbeat message for a new task after a period of
> current_heartbeat_interval/2 .
> (4) Jobtracker then find a new task for tasktracker B.
> This is just an primary idea for the improvement of the locality-aware
> scheduling. Any comments are welcome.
--
This message was sent by Atlassian JIRA
(v6.2#6252)