reprocessing hanging tasks

Stefan Groschupf Mon, 10 Oct 2005 06:32:59 -0700

Hi,
I tried to understand the jobtracker code.

Hmm more than 1000 lines of code in just one class. :-( This makesunderstanding code very difficult.

Anyway I'm missing a mechanism to reprocess hanging tasks. May I justdidn't find the code, but I invest some time to find it.As the google paper describe the original map reduce reprocess tasksthat may still run but are much slower than the other tasks becauseof some hardware failures.Since I notice that task-tracker isn't that stabile yet, I wouldreally love to have such a reprocessing mechanism.Actually I seen tasks are reprocessed in case the task-tracker crashand does not return any reports anymore or the task-tracker report atask failure.But for example in case the network speed of a fetching mapping taskis very very slow the job itself needs for ever.

I would suggest add start time and finishing time to the task objectand set these values until status changes.We can calculate a average time a task need for processing based onthis values.Than we have a configurable value of minimal finished tasks before westart to reprocessing tasks. For example 80% tasks need to be ready.Further more we have a configurable values threshold, in case theprocessing time of a task is treshold * average processing time, wejust reprocessing the task on a other tasktracker.


What do people think?

Do I miss the section in the jobtracker where this is done, or arepeople interested that I submit a patch doing this mechanism?

Stefan

reprocessing hanging tasks

Reply via email to