FSError encountered by one running task should not be fatal to other tasks on
that node
---------------------------------------------------------------------------------------
Key: HADOOP-1324
URL: https://issues.apache.org/jira/browse/HADOOP-1324
Project: Hadoop
Issue Type: Improvement
Components: mapred
Reporter: Devaraj Das
Currently, if one task encounters a FSError, it reports that to the TaskTracker
and the TaskTracker reinitializes itself and effectively loses state of all the
other running tasks too. This can probably be improved especially after the fix
for HADOOP-1252. The TaskTracker should probably avoid reinitializing itself
and instead get blacklisted for that job. Other tasks should be allowed to
continue as long as they can (complete successfully, or, fail either due to
disk problems or otherwise).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.