[
https://issues.apache.org/jira/browse/MAPREDUCE-4785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Haibo Chen updated MAPREDUCE-4785:
----------------------------------
Attachment: mapreduce4785.prelim.patch
The cause of this problem is that the update of task/taskAttempt states is not
in sync with that of TaskAttemptCompletionEvent of the job due to the async
nature of the dispatcher dispatching the events. If the async dispatcher is
delayed long enough, we will have cases where the task/taskAttempt states have
changed, but the TaskAttemptCompletionEvent[] is still stale. We could wait
for a maximum amount of time for the update of TaskAttemptCompletionEvents.
Only in extreme cases when the update is delayed longer than our wait time,
will this test fail and cause a false alarm. Please see my preliminary patch.
> TestMRApp occasionally fails
> ----------------------------
>
> Key: MAPREDUCE-4785
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4785
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.3-alpha
> Reporter: Jason Lowe
> Assignee: Haibo Chen
> Attachments: mapreduce4785.prelim.patch
>
>
> TestMRApp is failing occasionally with this error:
> {noformat}
> testUpdatedNodes(org.apache.hadoop.mapreduce.v2.app.TestMRApp): Expecting 2
> more completion events for killed expected:<4> but was:<2>
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)