Hi All,
We are developing a wrapper on top of oozie to automate failed/killed
coordinator action rerun.
To rerun a coordinator action, seems I have two options.
1. Using coordinator action rerun:
oozie job -rerun <coord_Job_id> <-date XXXX>
2. Since the failed action is a workflow job, I can also rerun that
workflow job by setting oozie.wf.rerun.failnodes to rerun from the failed
action.
Questions:
0. which option is preferred?
1. For option 1, can I choose to rerun from the failed action like the
oozie.wf.rerun.failnodes option in workflow rerun?
2. For option 1, seems I cannot change the job configurations. But for
option 2, I have more flexibility in changing the configurations, say I can
change the job name so that I know how many rerun has been made for that
workflow.
3. If I chose option 2, does it mean that the rerun workflow job is not
part of the coordinator actions any more? In another word, if I killed that
coordinator job, that rerun workflow job will be still running?
Thanks