Julian Endres created OOZIE-2758:
------------------------------------
Summary: Improve documentation for retries
Key: OOZIE-2758
URL: https://issues.apache.org/jira/browse/OOZIE-2758
Project: Oozie
Issue Type: Bug
Components: docs
Affects Versions: 4.3.0
Reporter: Julian Endres
In the oozie-site.xml the property oozie.action.retry.interval exists.
It is described as "The interval between retries of an action in case of
failure" without specifying a time unit.
>From the propertiey oozie.service.LiteWorkflowStoreService.user.retry.inteval
>which is described as "Automatic retry interval for workflow action is in
>minutes and the default value is 10 minutes." the user could assume that the
>property oozie.action.retry.interval is also minutes. However, as in
https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/action/ActionExecutor.java
one comment states "defaultRetryInterval retry interval, in seconds.".
In our environment the standard settings are used, and the application is
exactly doint this: do a retry every 10 SECONDS (then suspend after max number
of retries e.g. in the case of a distcp action). However the user might expect
a retry every 10 minutes.
Here is an excerpt from our log:
Next Retry, Attempt Number [2] in [10.000] milliseconds
This should at least be correctly documented if not aligned to the same unit.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)