[ https://issues.apache.org/jira/browse/YARN-10058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17002713#comment-17002713 ]
tuyu commented on YARN-10058: ----------------------------- when patch YARN-8737 to local repo, this can not fix race condition, the async thread also crash,and capacity scheduler will hang. I think no matter what happens,if global scheduler's async thread crash, current RM should exit or transition to standby. > Capacity Scheduler dispatcher hang when async thread crash > ---------------------------------------------------------- > > Key: YARN-10058 > URL: https://issues.apache.org/jira/browse/YARN-10058 > Project: Hadoop YARN > Issue Type: Bug > Components: capacity scheduler > Affects Versions: 3.2.0, 3.2.1 > Reporter: tuyu > Priority: Major > Fix For: 3.2.1 > > Attachments: 0001-global-scheduling-standby-hang.patch > > > when capacity scheduler enable global scheduler, if global scheduler's > AsyncScheduleThread crash, the capacity scheduler dispatcher will hang for > long time. This behavior is unreasonable. > if this situation happen, In HA mode, current RM should change to standby -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org