[ 
https://issues.apache.org/jira/browse/OOZIE-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13950183#comment-13950183
 ] 

Rohini Palaniswamy commented on OOZIE-1735:
-------------------------------------------

Test failure unrelated (TestEventGeneration.testCoordinatorActionEvent). RAT 
warning is due to patch not applying fully cleanly. 

patching file docs/src/site/twiki/DG_CommandLineTool.twiki
Hunk #1 succeeded at 321 (offset 3 lines).

Unapproved licenses:

  docs/src/site/twiki/DG_CommandLineTool.twiki.orig

> Support resuming of failed coordinator job and rerun of a failed coordinator 
> action
> -----------------------------------------------------------------------------------
>
>                 Key: OOZIE-1735
>                 URL: https://issues.apache.org/jira/browse/OOZIE-1735
>             Project: Oozie
>          Issue Type: Bug
>            Reporter: Purshotam Shah
>            Assignee: Purshotam Shah
>             Fix For: trunk
>
>         Attachments: OOZIE-1735-V2.patch, OOZIE-1735-V2.patch, 
> OOZIE-1735-V3.patch, OOZIE-1735_v1.patch
>
>
> We should support resuming of failed coordinator job. Job are set to failed 
> if there are runtime error( like SQL timeout).
> In current scenario there is no way to recover beside running SQL.
> Resuming of failed coordinator job should also set pending to 1 ,reset 
> doneMaterialization and last modified to current time. So that 
> materialization continues.
> We should also provide an option of resuming failed action. The behavior will 
> be same as killed option.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to