[ 
https://issues.apache.org/jira/browse/TEZ-2035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306215#comment-14306215
 ] 

Jonathan Eagles commented on TEZ-2035:
--------------------------------------

A sister jira is YARN-3144.

 Corresponding stack trace
{code}
java.lang.RuntimeException: Failed to connect to timeline server. Connection 
retries limit exceeded. The posted timeline event may be missing
        at 
org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl$TimelineClientConnectionRetry.retryOn(TimelineClientImpl.java:197)
        at 
org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl$TimelineJerseyRetryFilter.handle(TimelineClientImpl.java:236)
        at com.sun.jersey.api.client.Client.handle(Client.java:648)
        at com.sun.jersey.api.client.WebResource.handle(WebResource.java:670)
        at com.sun.jersey.api.client.WebResource.access$200(WebResource.java:74)
        at 
com.sun.jersey.api.client.WebResource$Builder.put(WebResource.java:533)
        at 
org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl.doPostingObject(TimelineClientImpl.java:456)
        at 
org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl.doPosting(TimelineClientImpl.java:301)
        at 
org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl.putDomain(TimelineClientImpl.java:295)
        at 
org.apache.tez.dag.history.ats.acls.ATSHistoryACLPolicyManager.createTimelineDomain(ATSHistoryACLPolicyManager.java:113)
        at 
org.apache.tez.dag.history.ats.acls.ATSHistoryACLPolicyManager.createSessionDomain(ATSHistoryACLPolicyManager.java:148)
        at 
org.apache.tez.dag.history.ats.acls.ATSHistoryACLPolicyManager.setupNonSessionACLs(ATSHistoryACLPolicyManager.java:215)
        at 
org.apache.tez.client.TezClientUtils.createApplicationSubmissionContext(TezClientUtils.java:536)
        at 
org.apache.tez.client.TezClient.submitDAGApplication(TezClient.java:726)
        at 
org.apache.tez.client.TezClient.submitDAGApplication(TezClient.java:703)
        at org.apache.tez.client.TezClient.submitDAG(TezClient.java:383)
        at 
org.apache.tez.mapreduce.examples.MRRSleepJob.run(MRRSleepJob.java:751)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at 
org.apache.tez.mapreduce.examples.MRRSleepJob.main(MRRSleepJob.java:399)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at 
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:71)
        at org.apache.hadoop.util.ProgramDriver.run(ProgramDriver.java:144)
        at 
org.apache.tez.mapreduce.examples.ExampleDriver.main(ExampleDriver.java:85)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at org.apache.hadoop.util.RunJar.run(RunJar.java:221)
        at org.apache.hadoop.util.RunJar.main(RunJar.java:136)
{code}

> Make timeline server putDomain exceptions non-fatal
> ---------------------------------------------------
>
>                 Key: TEZ-2035
>                 URL: https://issues.apache.org/jira/browse/TEZ-2035
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Jonathan Eagles
>            Assignee: Jonathan Eagles
>
> Timeline server operations have been non-fatal. However, the put domain 
> operation is fatal and will cause the job to fail to submit. This patch is a 
> simple fix to make the exception non-fatal. There is likely another follow-on 
> effort that could disable timeline server operations for the AM if putDomain 
> fails.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to