[
https://issues.apache.org/jira/browse/OOZIE-10?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Alejandro Abdelnur updated OOZIE-10:
------------------------------------
Fix Version/s: pre-Apache
> workflow action allow user auto retry
> -------------------------------------
>
> Key: OOZIE-10
> URL: https://issues.apache.org/jira/browse/OOZIE-10
> Project: Oozie
> Issue Type: New Feature
> Reporter: Angelo K. Huang
> Assignee: Angelo K. Huang
> Labels: oozie, workflow
> Fix For: pre-Apache
>
> Original Estimate: 336h
> Time Spent: 336h
> Remaining Estimate: 0h
>
> Workflow action only allows transient error retry currently. User often wants
> to control retry in each action level, such as define custom retry count for
> each action. For a FAILED action, the possible reason could be startData or
> endData not set or EL exception. The potential problem worth to retry is when
> Oozie not able to get running job with a hadoop id. For a ERROR action, most
> of errors come from job application error such as failed to parse action
> conf, buffer overflow in ssh executor, or file not existed in fs action
> executor.
> The solution is to define 0.3 workflow schema with new attributes in action
> level to get user defined retry and to add default Oozie conf for system
> level max user-retry. EX:
> workflow.xml
> <workflow-app xmlns="uri:oozie:workflow:0.3" name="test-wf">
> <action name="a" retry-max="2" retry-interval="1">
> </action>
> oozie-default.xml
> <!-- Workflow Action Automatic Retry -->
> <property>
> <name>oozie.service.LiteWorkflowStoreService.user.retry.max</name>
> <value>3</value>
> <description>
> Automatic retry max count for workflow action is 3 in default.
> </description>
> </property>
>
> <property>
>
> <name>oozie.service.LiteWorkflowStoreService.user.retry.inteval</name>
> <value>10</value>
> <description>
> Automatic retry interval for workflow action is in minutes
> and the default value is 10 minutes.
> </description>
> </property>
>
> <property>
>
> <name>oozie.service.LiteWorkflowStoreService.user.retry.error.code</name>
> <value>
> JA017
> </value>
> <description>
> Automatic retry interval for workflow action is handled for
> these specified error code.
> </description>
> </property>
>
> <property>
>
> <name>oozie.service.LiteWorkflowStoreService.user.retry.error.code.ext</name>
> <value> </value>
> <description>
> Automatic retry interval for workflow action is handled for
> these specified extra error code.
> </description>
> </property>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira