Julian Endres created OOZIE-2758:
------------------------------------

             Summary: Improve documentation for retries
                 Key: OOZIE-2758
                 URL: https://issues.apache.org/jira/browse/OOZIE-2758
             Project: Oozie
          Issue Type: Bug
          Components: docs
    Affects Versions: 4.3.0
            Reporter: Julian Endres


In the oozie-site.xml the property oozie.action.retry.interval exists. 

It is described as "The interval between retries of an action in case of 
failure" without specifying a time unit. 

>From the propertiey oozie.service.LiteWorkflowStoreService.user.retry.inteval 
>which is described as "Automatic retry interval for workflow action is in 
>minutes and the default value is 10 minutes." the user could assume that the 
>property oozie.action.retry.interval is also minutes. However, as in 

https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/action/ActionExecutor.java

one comment states "defaultRetryInterval retry interval, in seconds.". 

In our environment the standard settings are used, and the application is 
exactly doint this: do a retry every 10 SECONDS (then suspend after max number 
of retries e.g. in the case of a distcp action). However the user might expect 
a retry every 10 minutes. 

Here is an excerpt from our log: 

Next Retry, Attempt Number [2] in [10.000] milliseconds

This should at least be correctly documented if not aligned to the same unit.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to