[ https://issues.apache.org/jira/browse/YARN-713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13726750#comment-13726750 ]
Omkar Vinit Joshi commented on YARN-713: ---------------------------------------- * Today we are generating ContainerToken and NMToken separately. I guess we will have to couple both the things together... i.e. if nmtoken creation fails then there is no point in creating containerTokens for containers present on that node manager. * I see one problem in the existing patch which we definitely should address. In case of DNS hiccups we will keep generating new container Ids without actually using them.. I don't know if this is a serious issue but will definitely be annoying for someone trying to monitor application container. {code} ContainerId containerId = BuilderUtils.newContainerId(application .getApplicationAttemptId(), application.getNewContainerId()); {code} * As AMRMToken also is done irrespective of security need to check if that also requires similar fixes. > ResourceManager can exit unexpectedly if DNS is unavailable > ----------------------------------------------------------- > > Key: YARN-713 > URL: https://issues.apache.org/jira/browse/YARN-713 > Project: Hadoop YARN > Issue Type: Bug > Components: resourcemanager > Affects Versions: 2.1.0-beta > Reporter: Jason Lowe > Assignee: Omkar Vinit Joshi > Priority: Critical > Fix For: 2.1.0-beta > > Attachments: YARN-713.patch, YARN-713.patch, YARN-713.patch, > YARN-713.patch > > > As discussed in MAPREDUCE-5261, there's a possibility that a DNS outage could > lead to an unhandled exception in the ResourceManager's AsyncDispatcher, and > that ultimately would cause the RM to exit. The RM should not exit during > DNS hiccups. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira