[ 
https://issues.apache.org/jira/browse/OOZIE-10?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alejandro Abdelnur updated OOZIE-10:
------------------------------------

    Fix Version/s: pre-Apache
    
> workflow action allow user auto retry
> -------------------------------------
>
>                 Key: OOZIE-10
>                 URL: https://issues.apache.org/jira/browse/OOZIE-10
>             Project: Oozie
>          Issue Type: New Feature
>            Reporter: Angelo K. Huang
>            Assignee: Angelo K. Huang
>              Labels: oozie, workflow
>             Fix For: pre-Apache
>
>   Original Estimate: 336h
>          Time Spent: 336h
>  Remaining Estimate: 0h
>
> Workflow action only allows transient error retry currently. User often wants 
> to control retry in each action level, such as define custom retry count for 
> each action. For a FAILED action, the possible reason could be startData or 
> endData not set or EL exception. The potential problem worth to retry is when 
> Oozie not able to get running job with a hadoop id. For a ERROR action, most 
> of errors come from job application error such as failed to parse action 
> conf, buffer overflow in ssh executor, or file not existed in fs action 
> executor.
> The solution is to define 0.3 workflow schema with new attributes in action 
> level to get user defined retry and to add default Oozie conf for system 
> level max user-retry. EX:
> workflow.xml
> <workflow-app xmlns="uri:oozie:workflow:0.3" name="test-wf">
> <action name="a" retry-max="2" retry-interval="1">
> </action>
> oozie-default.xml
>    <!-- Workflow Action Automatic Retry -->
>     <property>
>         <name>oozie.service.LiteWorkflowStoreService.user.retry.max</name>
>         <value>3</value>
>         <description>
>             Automatic retry max count for workflow action is 3 in default.
>         </description>
>     </property>
>    
>     <property>
>         
> <name>oozie.service.LiteWorkflowStoreService.user.retry.inteval</name>
>         <value>10</value>
>         <description>
>             Automatic retry interval for workflow action is in minutes 
> and the default value is 10 minutes.
>         </description>
>     </property>
>    
>     <property>
>         
> <name>oozie.service.LiteWorkflowStoreService.user.retry.error.code</name>
>         <value>
>             JA017
>         </value>
>         <description>
>             Automatic retry interval for workflow action is handled for 
> these specified error code.
>         </description>
>     </property>
>    
>     <property>
>         
> <name>oozie.service.LiteWorkflowStoreService.user.retry.error.code.ext</name>
>         <value> </value>
>         <description>
>             Automatic retry interval for workflow action is handled for 
> these specified extra error code.
>         </description>
> </property>

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to