Re: reprocessing hanging tasks

Doug Cutting Mon, 10 Oct 2005 11:16:40 -0700

Stefan Groschupf wrote:

Do I miss the section in the jobtracker where this is done, or arepeople interested that I submit a patch doing this mechanism?

This is mostly already implemented. The tasktracker fails tasks that donot update their status within a configurable timeout. Task status isupdated each time a task reads an input, writes an output or calls theReporter.setStatus() method. The jobtracker will retry failed tasks upto four times.

The mapred-based fetcher also should not hang. It will exit even whenit has hung threads. So the task timeout should be set to the maximumamount of time that any single page should require to fetch & parse. Bydefault it is set to 10 minutes.


Doug

Re: reprocessing hanging tasks

Reply via email to