[ https://issues.apache.org/jira/browse/YARN-2313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071816#comment-14071816 ]
Hudson commented on YARN-2313: ------------------------------ SUCCESS: Integrated in Hadoop-Mapreduce-trunk #1840 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1840/]) YARN-2313. Livelock can occur in FairScheduler when there are lots of running apps (Tsuyoshi Ozawa via Sandy Ryza) (sandy: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1612769) * /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/dev-support/findbugs-exclude.xml * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/FairScheduler.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/FairSchedulerConfiguration.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/TestFairSchedulerPreemption.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/FairScheduler.apt.vm > Livelock can occur in FairScheduler when there are lots of running apps > ----------------------------------------------------------------------- > > Key: YARN-2313 > URL: https://issues.apache.org/jira/browse/YARN-2313 > Project: Hadoop YARN > Issue Type: Bug > Components: fairscheduler > Affects Versions: 2.4.1 > Reporter: Tsuyoshi OZAWA > Assignee: Tsuyoshi OZAWA > Fix For: 2.6.0 > > Attachments: YARN-2313.1.patch, YARN-2313.2.patch, YARN-2313.3.patch, > YARN-2313.4.patch, rm-stack-trace.txt > > > Observed livelock on FairScheduler when there are lots entry in queue. After > my investigating code, following case can occur: > 1. {{update()}} called by UpdateThread takes longer times than > UPDATE_INTERVAL(500ms) if there are lots queue. > 2. UpdateThread goes busy loop. > 3. Other threads(AllocationFileReloader, > ResourceManager$SchedulerEventDispatcher) can wait forever. -- This message was sent by Atlassian JIRA (v6.2#6252)