[ 
https://issues.apache.org/jira/browse/YARN-674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13805494#comment-13805494
 ] 

Zhijie Shen commented on YARN-674:
----------------------------------

I've a quick look at the patch. Here're my comments:

1. It seem that the change in RMAppManager is not necessary, because the 
current logic is to reject the app in the secure case when parsing the 
credentials and adding the apps to DelegationTokenRenewer have something wrong; 
otherwise, the app will be accepted. Though there's no obvious "if... else..." 
structure, it achieves the same logic control via:
{code}
      throw RPCUtil.getRemoteException(ie);
{code}
I think the exception needs to be thrown, which is missing in your patch. The 
exception will notice the client that the app submission fails; otherwise, the 
client will think the submission succeeds?

If I miss some ideas here, please let me know.

2. Since DelegationTokenRenewer#addApplication becomes asynchronous, what will 
the impact of that the application is already accepted and starts its life 
cycle, while DelegationTokenRenewer is so slow to 
DelegationTokenRenewerAppSubmitEvent. Will the application fail somewhere else 
due to the fresh token unavailable?

3. I noticed testConncurrentAddApplication has been removed. Does the change 
affect the current app submission?

> Slow or failing DelegationToken renewals on submission itself make RM 
> unavailable
> ---------------------------------------------------------------------------------
>
>                 Key: YARN-674
>                 URL: https://issues.apache.org/jira/browse/YARN-674
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>            Reporter: Vinod Kumar Vavilapalli
>            Assignee: Omkar Vinit Joshi
>         Attachments: YARN-674.1.patch
>
>
> This was caused by YARN-280. A slow or a down NameNode for will make it look 
> like RM is unavailable as it may run out of RPC handlers due to blocked 
> client submissions.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to