[ https://issues.apache.org/jira/browse/MAPREDUCE-1499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12838131#action_12838131 ]
Matei Zaharia commented on MAPREDUCE-1499: ------------------------------------------ The problem in MAPREDUCE-1436 only happened with a backport of the trunk fair scheduler to 0.20. It probably won't happen with the 0.20 fair scheduler. However, anyone packaging a distribution that includes a backport of the fair scheduler should look into it. As I mentioned in MAPREDUCE-1436, the fix (adding a lock around the TaskTrackerManager in update() is simple and is unlikely to have a major impact on performance. > JobTracker.finalizeJob inverts lock order and causes potential deadlock > ----------------------------------------------------------------------- > > Key: MAPREDUCE-1499 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1499 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: jobtracker > Affects Versions: 0.20.1, 0.20.2 > Reporter: Todd Lipcon > Assignee: Aaron Kimball > Priority: Critical > Attachments: jcarder_result_1.dot.png, jcarder_result_8.dot.png, > MAPREDUCE-1499.patch > > > This issue was brought up by Matei in MAPREDUCE-1436 as a fairsched bug, but > it turns out it's a JT bug even with the fifo scheduler in unpatched 0.20.2. > JobTracker.finalizeJob locks JT.jobs, JT.taskScheduler, etc, having gotten > the JIP log before the JT lock. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.