[ 
https://issues.apache.org/jira/browse/YARN-5445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15426700#comment-15426700
 ] 

Daniel Templeton commented on YARN-5445:
----------------------------------------

Moving {{DFS_CLIENT_FAILOVER_MAX_ATTEMPTS_KEY}} to {{CommonConfigurationKeys}} 
doesn't sound like the right solution.  If the log aggregation code is going to 
depend on specific behavior of HDFS why shouldn't the project depend on 
hadoop-hdfs?

As you pointed out, there are other JIRAs that are attempting to resolve the 
base issue that is prompting your unusual cluster config.  I don't think 
introducing a new configuration parameter to deal with a temporary issue is a 
good idea.  Config params, like diamonds, are forever, and we already have 
entirely too many.

I haven't looked at the code for log aggregation.  What happens what the DFS 
connection fails?

> Log aggregation configured to different namenode can fail fast
> --------------------------------------------------------------
>
>                 Key: YARN-5445
>                 URL: https://issues.apache.org/jira/browse/YARN-5445
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.7.1
>            Reporter: Chackaravarthy
>         Attachments: YARN-5445-1.patch
>
>
> Log aggregation is enabled and configured to write applogs to different 
> cluster or different namespace (NN federation). In these cases, would like to 
> have some configs on attempts or retries to fail fast in case the other 
> cluster is completely down.
> Currently it takes default {{dfs.client.failover.max.attempts}} as 15 and 
> hence adding a latency of 2 to 2.5 mins in each container launch (per node 
> manager).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to