[
https://issues.apache.org/jira/browse/YARN-6061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15844002#comment-15844002
]
Yufei Gu commented on YARN-6061:
--------------------------------
Digging through YARN-2579, it is not a good idea to do transitionToStandby in
RMFatalEventDispatcher because of following call sequences:
{noformat}
[eventHandlingThread]: RMFatalEventDispatcher handler => transitionToStandby()
=> ResetDispatch() => serviceStop() => eventHandlingThread.join().
{noformat}
Instead we could reuse the StandByTransitionThread in the class RMStateStore to
do transition to standby.
> Add a customized uncaughtexceptionhandler for critical threads in RM
> --------------------------------------------------------------------
>
> Key: YARN-6061
> URL: https://issues.apache.org/jira/browse/YARN-6061
> Project: Hadoop YARN
> Issue Type: Improvement
> Components: resourcemanager
> Reporter: Yufei Gu
> Assignee: Yufei Gu
> Attachments: YARN-6061.001.patch, YARN-6061.002.patch
>
>
> There are several threads in fair scheduler. The thread will quit when there
> is a runtime exception inside it. We should bring down the RM when that
> happens. Otherwise, there may be some weird behavior in RM.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]