[ 
https://issues.apache.org/jira/browse/YARN-6061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15844002#comment-15844002
 ] 

Yufei Gu commented on YARN-6061:
--------------------------------

Digging through YARN-2579, it is not a good idea to do transitionToStandby in 
RMFatalEventDispatcher because of following call sequences:
{noformat}
[eventHandlingThread]: RMFatalEventDispatcher handler => transitionToStandby() 
=> ResetDispatch() => serviceStop() => eventHandlingThread.join(). 
{noformat}
Instead we could reuse the StandByTransitionThread in the class RMStateStore to 
do transition to standby.

> Add a customized uncaughtexceptionhandler for critical threads in RM
> --------------------------------------------------------------------
>
>                 Key: YARN-6061
>                 URL: https://issues.apache.org/jira/browse/YARN-6061
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: resourcemanager
>            Reporter: Yufei Gu
>            Assignee: Yufei Gu
>         Attachments: YARN-6061.001.patch, YARN-6061.002.patch
>
>
> There are several threads in fair scheduler. The thread will quit when there 
> is a runtime exception inside it. We should bring down the RM when that 
> happens. Otherwise, there may be some weird behavior in RM. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to