Author: vanto
Date: Fri Dec 28 16:47:33 2012
New Revision: 1426574
URL: http://svn.apache.org/viewvc?rev=1426574&view=rev
Log:
link fixes.
Added:
ode/site/trunk/content/extensions/activity-failure-and-recovery.mdtext
Removed:
ode/site/trunk/content/activity-failure-and-recovery.mdtext
Added: ode/site/trunk/content/extensions/activity-failure-and-recovery.mdtext
URL:
http://svn.apache.org/viewvc/ode/site/trunk/content/extensions/activity-failure-and-recovery.mdtext?rev=1426574&view=auto
==============================================================================
--- ode/site/trunk/content/extensions/activity-failure-and-recovery.mdtext
(added)
+++ ode/site/trunk/content/extensions/activity-failure-and-recovery.mdtext Fri
Dec 28 16:47:33 2012
@@ -0,0 +1,103 @@
+Title: Activity Failure and Recovery
+There are several types of error conditions. In this document we introduce a
class of error condition called _failures_, distinct from _faults_, and
describe how failures are caught and handled by the process engine.
+
+A service returns a fault in response to a request it cannot process. A
process may also raise a fault internally when it encounters a terminal error
condition, e.g. a faulty expression or false join condition. In addition,
processes may raise faults in order to terminate normal processing.
+
+In contrast, failures are non-terminal error conditions that do not affect the
normal flow of the process. We keep the process definition simple and
straightforward by delegating failure handling to the process engine and
administrator.
+
+For example, when the process is unable to perform DNS resolution to determine
the service endpoint, it generates a failure. An administrator can fix the DNS
server and tell the process engine to retry the activity. Had the DNS error
been reported as a fault, the process would either terminate or require complex
fault handling and recovery logic to proceed past this point of failure.
+
+In short, failures shields the process from common, non-terminal error
conditions while retaining simple and straightforward process definitions that
do not need to account for these error conditions.
+
+<a name="ActivityFailureandRecovery-FromFailuretoRecovery"></a>
+### From Failure to Recovery
+Currently, the _Invoke_ activity is the only activity that supports failure
handling and recovery. The mechanism is identical for all other activities that
may support failure handling and recovery in the future.
+
+In case of the _Invoke_ activity, a failure condition is triggered by the
integration layer, in lieu of a response or fault message. The _Invoke_
activity consults its failure handling policy (specified here) and decides how
to respond.
+
+Set _faultOnFailure_ to _yes_, if you want the activity to throw a fault on
failure. All other failure handling settings are ignored. The activity will
throw the _activityFailure_ fault.
+
+The _activityFailure_ fault is a standard fault, so you can use the
_exitOnStandardFault_ attribute to control whether the process exits
immediately, or throws a fault in the enclosing scope.
+
+Set _retryFor_ to a positive integer if you want the activity to attempt
self-recovery and retry up to that number of times. Set _retryDelay_ to a
reasonable time delay (specified in seconds) between retries. For example, if
you set _retryFor=2, retryDelay=30_, the activity will retry after 30 and 60
seconds, for a total of three attempts, before entering activity recovery mode.
+
+If the activity retries and succeeds, it will complete successfully as if no
failure occurred. Of course, the activity may retry and fault, e.g. if the
invoked service returns a fault. If the activity has exhausted all retry
attempts, it enters activity recovery mode. By default _retryFor_ is zero, and
the activity enters recovery mode after the first failure.
+
+When in recovery mode, you can recover the activity in one of three ways:
+* *Retry* \-\- Retry the activity manually. You can repeat this any number of
times until the activity completes or faults.
+* *Fault* \-\- Causes the activity to throw the _activityFailure_ fault.
+* *Cancel* \-\- Cancels the activity. The activity completes unsuccessfully:
without changing the state of variables, by setting the status of all its
source links to false, and without installing a compensation handler.
+Activity recovery is performed individually for each activity instance, and
does not affect other activities executing in the same process. While the
activity is in the _FAILURE_ state, the process instance remains in the
_ACTIVE_ state and may execute other activities from parallel flows and event
handlers.
+
+<a name="ActivityFailureandRecovery-SpecifyingFailureBehavior"></a>
+### Specifying Failure Behavior
+Use the _failureHandling_ extensibility element defined in the namespace
`http://ode.apache.org/activityRecovery`. The structure of the
_failureHandling_ element is:
+
+
+ <ext:failureHandling xmlns:ext="http://ode.apache.org/activityRecovery">
+ <ext:faultOnFailure> _boolean_ </ext:faultOnFailure>
+ <ext:retryFor> _integer_ </ext:retryFor>
+ <ext:retryDelay> _integer_ </ext:retryDelay>
+ </ext:failureHandling>
+
+
+The _faultOnFailure_, _retryFor_ and _retryDelay_ elements are optional. The
default values are false for _faultOnFailure_, and zero for _retryFor_ and
_retryDelay_.
+
+An activity that does not specify failure handling using this extensibility
element, inherits the failure handling policy of its parent activity,
recursively up to the top-level activity of the process. You can useinheritence
to specify the failure handling policy of a set of activities, or all
activities in the process, using a single _failureHandling_ extensibility
element.
+
+Note that due to this behavior, if activity _S_ specifies failure handling
with the values _retryFor=2, retryDelay=60_, and has a child activity _R_ that
specifies failure handling with the values _retryFor=3_, the _retryDelay_ value
for the child activity _R_ is 0, and not 60. Using the _failureHandling_
element without specifying one of its value elements will apply the default
value for that element.
+
+<a name="ActivityFailureandRecovery-Examples"></a>
+### Examples
+
+A simple invoke with the `ext:failureHandling` extension:
+
+
+ <bpel:invoke inputVariable="myRequest"
+ operation="foo" outputVariable="aResponse"
+ partnerLink="myPartner" portType="spt:SomePortType">
+
+ <ext:failureHandling
xmlns:ext="http://ode.apache.org/activityRecovery">
+ <ext:faultOnFailure>false</ext:faultOnFailure>
+ <ext:retryFor>2</ext:retryFor>
+ <ext:retryDelay>60</ext:retryDelay>
+ </ext:failureHandling>
+
+ </bpel:invoke>
+
+
+And a sequence activity that converts failures into faults:
+
+
+ <bpel:sequence>
+
+ <ext:failureHandling
xmlns:ext="http://ode.apache.org/activityRecovery">
+ <ext:faultOnFailure>true</ext:faultOnFailure>
+ </ext:failureHandling>
+
+ ...
+
+ <bpel:invoke inputVariable="myRequest"
+ operation="foo" outputVariable="aResponse"
+ partnerLink="myPartner" portType="spt:SomePortType">
+
+ <bpel:catchAll>
+ ...
+ </bpel:catchAll>
+
+ </bpel:invoke>
+
+ </bpel:sequence>
+
+
+
+<a name="ActivityFailureandRecovery-ProcessInstanceManagement"></a>
+### Process Instance Management
+The process instance management provides the following information:
+* Process instance summary includes a _failures_ element with a count of the
total number of process instances that have one or more activities in recovery
mode, and the date/time of the last activity to enter recovery mode. The
element exists if at least one activity is in recovery mode.
+* Process instance information includes a _failures_ element with a count of
the number of activities in recovery mode, and the date/time of the last
activity to enter recovery mode. The element exists if at least one activity is
in recovery mode.
+* Activity instance information includes a _failure_ element that specifies
the date/time of the failure, the reason for the failure, number of retries,
and list of available recovery actions. The element exists if the activity is
in the state _FAILURE_.
+
+Use the _recoverActivity_ operation to perform a recovery action on an
activity in recovery mode. The operation requires the process instance ID, the
activity instance ID and the recovery action to perform (one of retry, fault or
cancel).
+
+You can also determine when failure or recovery occurred for a given activity
instance from the execution log.