[ https://issues.apache.org/jira/browse/YARN-690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660388#comment-13660388 ]
Vinod Kumar Vavilapalli commented on YARN-690: ---------------------------------------------- Bobby, the fix went in a little too fast for any of us to notice, you should give others a bit of time to be able to look at it. Tx. While this is a quick fix that should help, we should think of more long term solutions - specifically looking for correct exceptions etc. After our recent exception work, mainly after YARN-628 and MAPREDUCE-5254, we can look for IOException specifically. Is that enough? > RM exits on token cancel/renew problems > --------------------------------------- > > Key: YARN-690 > URL: https://issues.apache.org/jira/browse/YARN-690 > Project: Hadoop YARN > Issue Type: Bug > Components: resourcemanager > Affects Versions: 3.0.0, 0.23.7, 2.0.5-beta > Reporter: Daryn Sharp > Assignee: Daryn Sharp > Priority: Blocker > Fix For: 3.0.0, 2.0.5-beta, 0.23.8 > > Attachments: YARN-690.patch, YARN-690.patch > > > The DelegationTokenRenewer thread is critical to the RM. When a > non-IOException occurs, the thread calls System.exit to prevent the RM from > running w/o the thread. It should be exiting only on non-RuntimeExceptions. > The problem is especially bad in 23 because the yarn protobuf layer converts > IOExceptions into UndeclaredThrowableExceptions (RuntimeException) which > causes the renewer to abort the process. An UnknownHostException takes down > the RM... -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira