[jira] [Updated] (MAPREDUCE-6776) yarn.app.mapreduce.client.job.max-retries should have a more useful default

2016-10-07 Thread Robert Kanter (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Kanter updated MAPREDUCE-6776:
-
   Resolution: Fixed
 Hadoop Flags: Incompatible change,Reviewed
Fix Version/s: 3.0.0-alpha2
   2.9.0
 Release Note: The default value of 
yarn.app.mapreduce.client.job.max-retries has been changed from 0 to 3.  This 
will help protect clients from failures that are transient.  True failures may 
take slightly longer now due to the retries.
   Status: Resolved  (was: Patch Available)

Thanks [~miklos.szeg...@cloudera.com] and everyone.  Committed to trunk and 
branch-2!

> yarn.app.mapreduce.client.job.max-retries should have a more useful default
> ---
>
> Key: MAPREDUCE-6776
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6776
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: client
>Affects Versions: 2.8.0
>Reporter: Daniel Templeton
>Assignee: Miklos Szegedi
> Fix For: 2.9.0, 3.0.0-alpha2
>
> Attachments: MAPREDUCE-6776.001.patch, MAPREDUCE-6776.002.patch, 
> MAPREDUCE-6776.003.patch
>
>
> The default is 0, so any communication failure results in a client failure.  
> Oozie doesn't like that.  If the RM is failing over and Oozie gets a 
> communication failure, it assumes the target job has failed.  I propose 
> raising the default to something modest like 3 or 5.  The default retry 
> interval is 2s.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Updated] (MAPREDUCE-6776) yarn.app.mapreduce.client.job.max-retries should have a more useful default

2016-09-27 Thread Miklos Szegedi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Miklos Szegedi updated MAPREDUCE-6776:
--
Attachment: MAPREDUCE-6776.003.patch

Fixed the requests in the comments.

> yarn.app.mapreduce.client.job.max-retries should have a more useful default
> ---
>
> Key: MAPREDUCE-6776
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6776
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: client
>Affects Versions: 2.8.0
>Reporter: Daniel Templeton
>Assignee: Miklos Szegedi
> Attachments: MAPREDUCE-6776.001.patch, MAPREDUCE-6776.002.patch, 
> MAPREDUCE-6776.003.patch
>
>
> The default is 0, so any communication failure results in a client failure.  
> Oozie doesn't like that.  If the RM is failing over and Oozie gets a 
> communication failure, it assumes the target job has failed.  I propose 
> raising the default to something modest like 3 or 5.  The default retry 
> interval is 2s.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Updated] (MAPREDUCE-6776) yarn.app.mapreduce.client.job.max-retries should have a more useful default

2016-09-26 Thread Miklos Szegedi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Miklos Szegedi updated MAPREDUCE-6776:
--
Attachment: MAPREDUCE-6776.002.patch

Incorporated some style changes

> yarn.app.mapreduce.client.job.max-retries should have a more useful default
> ---
>
> Key: MAPREDUCE-6776
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6776
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: client
>Affects Versions: 2.8.0
>Reporter: Daniel Templeton
>Assignee: Miklos Szegedi
> Attachments: MAPREDUCE-6776.001.patch, MAPREDUCE-6776.002.patch
>
>
> The default is 0, so any communication failure results in a client failure.  
> Oozie doesn't like that.  If the RM is failing over and Oozie gets a 
> communication failure, it assumes the target job has failed.  I propose 
> raising the default to something modest like 3 or 5.  The default retry 
> interval is 2s.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Updated] (MAPREDUCE-6776) yarn.app.mapreduce.client.job.max-retries should have a more useful default

2016-09-21 Thread Miklos Szegedi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Miklos Szegedi updated MAPREDUCE-6776:
--
Status: Patch Available  (was: Open)

> yarn.app.mapreduce.client.job.max-retries should have a more useful default
> ---
>
> Key: MAPREDUCE-6776
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6776
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: client
>Affects Versions: 2.8.0
>Reporter: Daniel Templeton
>Assignee: Miklos Szegedi
> Attachments: MAPREDUCE-6776.001.patch
>
>
> The default is 0, so any communication failure results in a client failure.  
> Oozie doesn't like that.  If the RM is failing over and Oozie gets a 
> communication failure, it assumes the target job has failed.  I propose 
> raising the default to something modest like 3 or 5.  The default retry 
> interval is 2s.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Updated] (MAPREDUCE-6776) yarn.app.mapreduce.client.job.max-retries should have a more useful default

2016-09-21 Thread Miklos Szegedi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Miklos Szegedi updated MAPREDUCE-6776:
--
Attachment: MAPREDUCE-6776.001.patch

Set a default retry count of 3.

> yarn.app.mapreduce.client.job.max-retries should have a more useful default
> ---
>
> Key: MAPREDUCE-6776
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6776
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: client
>Affects Versions: 2.8.0
>Reporter: Daniel Templeton
>Assignee: Miklos Szegedi
> Attachments: MAPREDUCE-6776.001.patch
>
>
> The default is 0, so any communication failure results in a client failure.  
> Oozie doesn't like that.  If the RM is failing over and Oozie gets a 
> communication failure, it assumes the target job has failed.  I propose 
> raising the default to something modest like 3 or 5.  The default retry 
> interval is 2s.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Updated] (MAPREDUCE-6776) yarn.app.mapreduce.client.job.max-retries should have a more useful default

2016-09-21 Thread Haibo Chen (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Haibo Chen updated MAPREDUCE-6776:
--
Assignee: Miklos Szegedi  (was: Daniel Templeton)

> yarn.app.mapreduce.client.job.max-retries should have a more useful default
> ---
>
> Key: MAPREDUCE-6776
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6776
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: client
>Affects Versions: 2.8.0
>Reporter: Daniel Templeton
>Assignee: Miklos Szegedi
>
> The default is 0, so any communication failure results in a client failure.  
> Oozie doesn't like that.  If the RM is failing over and Oozie gets a 
> communication failure, it assumes the target job has failed.  I propose 
> raising the default to something modest like 3 or 5.  The default retry 
> interval is 2s.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Updated] (MAPREDUCE-6776) yarn.app.mapreduce.client.job.max-retries should have a more useful default

2016-09-12 Thread Daniel Templeton (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Templeton updated MAPREDUCE-6776:

Description: The default is 0, so any communication failure results in a 
client failure.  Oozie doesn't like that.  If the RM is failing over and Oozie 
gets a communication failure, it assumes the target job has failed.  I propose 
raising the default to something modest like 3 or 5.  The default retry 
interval is 2s.  (was: The default is 0, so any communication results in a 
client failure.  Oozie doesn't like that.  If the RM is failing over and Oozie 
gets a communication failure, it assumes the target job has failed.  I propose 
raising the default to something modest like 3 or 5.  The default retry 
interval is 2s.)

> yarn.app.mapreduce.client.job.max-retries should have a more useful default
> ---
>
> Key: MAPREDUCE-6776
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6776
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: client
>Affects Versions: 2.8.0
>Reporter: Daniel Templeton
>Assignee: Daniel Templeton
>
> The default is 0, so any communication failure results in a client failure.  
> Oozie doesn't like that.  If the RM is failing over and Oozie gets a 
> communication failure, it assumes the target job has failed.  I propose 
> raising the default to something modest like 3 or 5.  The default retry 
> interval is 2s.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org