Re: reprocessing hanging tasks

Stefan Groschupf Mon, 10 Oct 2005 11:39:16 -0700

Doug,

I definitely run several times in problems, where task-trackers wassending hard-beat messages but hadn't process the job anymore.For example no new pages was fetched but the page / sec. statisticbecomes slow and slower.I personal would think it makes more sense in case the jobtrackerdecide if a task is over the average processing time and need to bereexcuted or not.The last section of the google paper covers this issue and theynotice performance improvements by reexecutng task that are over aspecific time.

May we misunderstand each other, I do not mean tasks that crash, Imean tasks that are 20 times slower on one machine as the other taskson the other machines.


Stefan


Am 10.10.2005 um 20:16 schrieb Doug Cutting:

Stefan Groschupf wrote:
Do I miss the section in the jobtracker where this is done, orare people interested that I submit a patch doing this mechanism?
This is mostly already implemented. The tasktracker fails tasksthat do not update their status within a configurable timeout.Task status is updated each time a task reads an input, writes anoutput or calls the Reporter.setStatus() method. The jobtrackerwill retry failed tasks up to four times.
The mapred-based fetcher also should not hang. It will exit evenwhen it has hung threads. So the task timeout should be set to themaximum amount of time that any single page should require to fetch& parse. By default it is set to 10 minutes.
Doug


---------------------------------------------------------------
company:        http://www.media-style.com
forum:        http://www.text-mining.org
blog:            http://www.find23.net

Re: reprocessing hanging tasks

Reply via email to