[ 
https://issues.apache.org/jira/browse/YARN-713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13726750#comment-13726750
 ] 

Omkar Vinit Joshi commented on YARN-713:
----------------------------------------

* Today we are generating ContainerToken and NMToken separately. I guess we 
will have to couple both the things together... i.e. if nmtoken creation fails 
then there is no point in creating containerTokens for containers present on 
that node manager.
* I see one problem in the existing patch which we definitely should address. 
In case of DNS hiccups we will keep generating new container Ids without 
actually using them.. I don't know if this is a serious issue but will 
definitely be annoying for someone trying to monitor application container.
{code}
        ContainerId containerId = BuilderUtils.newContainerId(application
            .getApplicationAttemptId(), application.getNewContainerId());
{code}
* As AMRMToken also is done irrespective of security need to check if that also 
requires similar fixes.
                
> ResourceManager can exit unexpectedly if DNS is unavailable
> -----------------------------------------------------------
>
>                 Key: YARN-713
>                 URL: https://issues.apache.org/jira/browse/YARN-713
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.1.0-beta
>            Reporter: Jason Lowe
>            Assignee: Omkar Vinit Joshi
>            Priority: Critical
>             Fix For: 2.1.0-beta
>
>         Attachments: YARN-713.patch, YARN-713.patch, YARN-713.patch, 
> YARN-713.patch
>
>
> As discussed in MAPREDUCE-5261, there's a possibility that a DNS outage could 
> lead to an unhandled exception in the ResourceManager's AsyncDispatcher, and 
> that ultimately would cause the RM to exit.  The RM should not exit during 
> DNS hiccups.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to