Yesha Vora created YARN-5098:
--------------------------------

             Summary: Yarn Application log Aggreagation fails due to NM can not 
get correct HDFS delegation token
                 Key: YARN-5098
                 URL: https://issues.apache.org/jira/browse/YARN-5098
             Project: Hadoop YARN
          Issue Type: Bug
          Components: yarn
            Reporter: Yesha Vora


Scenario:
* set dfs.namenode.delegation.token.max-lifetime=43200000 and 
dfs.namenode.delegation.token.renew-interval=28800000
* Start long running applications
* Let these applications run for ~3 days
* After 3 days, kill Kill the applications
* try to get application logs for above long running apps.

However, Yarn application logs for long running application could not be 
gathered because Nodemanager failed to talk to HDFS with below error.
{code}
2016-05-16 18:18:28,533 INFO  logaggregation.AppLogAggregatorImpl 
(AppLogAggregatorImpl.java:finishLogAggregation(555)) - Application just 
finished : application_1463170334122_0002
2016-05-16 18:18:28,545 WARN  ipc.Client (Client.java:run(705)) - Exception 
encountered while connecting to the server :
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.security.token.SecretManager$InvalidToken):
 token (HDFS_DELEGATION_TOKEN token 171 for hrt_qa) can't be found in cache
        at 
org.apache.hadoop.security.SaslRpcClient.saslConnect(SaslRpcClient.java:375)
        at 
org.apache.hadoop.ipc.Client$Connection.setupSaslConnection(Client.java:583)
        at org.apache.hadoop.ipc.Client$Connection.access$1900(Client.java:398)
        at org.apache.hadoop.ipc.Client$Connection$2.run(Client.java:752)
        at org.apache.hadoop.ipc.Client$Connection$2.run(Client.java:748)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1719)
        at 
org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:747)
        at org.apache.hadoop.ipc.Client$Connection.access$3100(Client.java:398)
        at org.apache.hadoop.ipc.Client.getConnection(Client.java:1597)
        at org.apache.hadoop.ipc.Client.call(Client.java:1439)
        at org.apache.hadoop.ipc.Client.call(Client.java:1386)
        at 
org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:240)
        at com.sun.proxy.$Proxy83.getServerDefaults(Unknown Source)
        at 
org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.getServerDefaults(ClientNamenodeProtocolTranslatorPB.java:282)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:498)
        at 
org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:256)
        at 
org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:104)
        at com.sun.proxy.$Proxy84.getServerDefaults(Unknown Source)
        at 
org.apache.hadoop.hdfs.DFSClient.getServerDefaults(DFSClient.java:1018)
        at org.apache.hadoop.fs.Hdfs.getServerDefaults(Hdfs.java:156)
        at 
org.apache.hadoop.fs.AbstractFileSystem.create(AbstractFileSystem.java:550)
        at org.apache.hadoop.fs.FileContext$3.next(FileContext.java:687)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org

Reply via email to