[
https://issues.apache.org/jira/browse/MAPREDUCE-4842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arun C Murthy updated MAPREDUCE-4842:
-------------------------------------
Attachment: MAPREDUCE-4842.patch
Jason, nice unit test! Thanks!
I've modified it a little to have 2 barriers (mergeStart and mergeComplete)
rather than use the same 4 times (confused me a lot when I was reviewing it).
Other than that, it looks great. +1
Also, if you don't mind, I'll assign the jira to you - since you've done all
the heavy lifting and deserve way more credit than I do. Thanks again!
> Shuffle race can hang reducer
> -----------------------------
>
> Key: MAPREDUCE-4842
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4842
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2
> Affects Versions: 2.0.2-alpha, 0.23.5
> Reporter: Jason Lowe
> Assignee: Arun C Murthy
> Priority: Blocker
> Attachments: MAPREDUCE-4842.patch, MAPREDUCE-4842.patch,
> MAPREDUCE-4842.patch
>
>
> Saw an instance where the shuffle caused multiple reducers in a job to hang.
> It looked similar to the problem described in MAPREDUCE-3721, where the
> fetchers were all being told to WAIT by the MergeManager but no merge was
> taking place.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira