[ 
https://issues.apache.org/jira/browse/YARN-10754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18020218#comment-18020218
 ] 

ASF GitHub Bot commented on YARN-10754:
---------------------------------------

github-actions[bot] commented on PR #7297:
URL: https://github.com/apache/hadoop/pull/7297#issuecomment-3290090732

   We're closing this stale PR because it has been open for 100 days with no 
activity. This isn't a judgement on the merit of the PR in any way. It's just a 
way of keeping the PR queue manageable.
   If you feel like this was a mistake, or you would like to continue working 
on it, please feel free to re-open it and ask for a committer to remove the 
stale tag and review again.
   Thanks all for your contribution.




> RM Renew Delegation token thread should timeout and retry should also 
> consider app new submitted.
> -------------------------------------------------------------------------------------------------
>
>                 Key: YARN-10754
>                 URL: https://issues.apache.org/jira/browse/YARN-10754
>             Project: Hadoop YARN
>          Issue Type: Improvement
>            Reporter: Qi Zhu
>            Assignee: Qi Zhu
>            Priority: Major
>              Labels: pull-request-available
>         Attachments: YARN-10754.001.patch, image-2021-04-27-11-38-29-162.png
>
>
> As  YARN-9768 described:
> Delegation token renewer thread in RM (DelegationTokenRenewer.java) renews 
> HDFS tokens received to check for validity and expiration time.
> This call is made to an underlying HDFS NN or Router Node (which has exact 
> APIs as HDFS NN). If one of the nodes is bad and the renew call is stuck the 
> thread remains stuck indefinitely. The thread should ideally timeout the 
> renewToken and retry from the client's perspective.
> But it only consider the app recovery, not consider the app submitted:
> !image-2021-04-27-11-38-29-162.png|width=516,height=428!
> It will cause the app submitted not retry, when renew token (HDFS Namenode/ 
> Router) timeout. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to