[
https://issues.apache.org/jira/browse/MAPREDUCE-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13085880#comment-13085880
]
Allen Wittenauer commented on MAPREDUCE-2846:
---------------------------------------------
*nods* I'm mostly convinced it is a race condition in MR-2415. I haven't had
enough time to start playing in the source to track it down more. I did talk
to Owen about already, but thought it might be useful to at least get the JIRA
filed to put more eyes on it since race conditions are usually pretty awful to
track down.
> approx 10% of all tasks fail with DefaultTaskController
> -------------------------------------------------------
>
> Key: MAPREDUCE-2846
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2846
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: task, task-controller, tasktracker
> Affects Versions: 0.20.204.0
> Reporter: Allen Wittenauer
> Priority: Blocker
>
> After upgrading our test 0.20.203 grid to 0.20.204-rc2, we ran terasort to
> verify operation. While the job completed successfully, approx 10% of the
> tasks failed with task runner execution errors and the inability to create
> symlinks for attempt logs.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira