[ https://issues.apache.org/jira/browse/OOZIE-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13950183#comment-13950183 ]
Rohini Palaniswamy commented on OOZIE-1735: ------------------------------------------- Test failure unrelated (TestEventGeneration.testCoordinatorActionEvent). RAT warning is due to patch not applying fully cleanly. patching file docs/src/site/twiki/DG_CommandLineTool.twiki Hunk #1 succeeded at 321 (offset 3 lines). Unapproved licenses: docs/src/site/twiki/DG_CommandLineTool.twiki.orig > Support resuming of failed coordinator job and rerun of a failed coordinator > action > ----------------------------------------------------------------------------------- > > Key: OOZIE-1735 > URL: https://issues.apache.org/jira/browse/OOZIE-1735 > Project: Oozie > Issue Type: Bug > Reporter: Purshotam Shah > Assignee: Purshotam Shah > Fix For: trunk > > Attachments: OOZIE-1735-V2.patch, OOZIE-1735-V2.patch, > OOZIE-1735-V3.patch, OOZIE-1735_v1.patch > > > We should support resuming of failed coordinator job. Job are set to failed > if there are runtime error( like SQL timeout). > In current scenario there is no way to recover beside running SQL. > Resuming of failed coordinator job should also set pending to 1 ,reset > doneMaterialization and last modified to current time. So that > materialization continues. > We should also provide an option of resuming failed action. The behavior will > be same as killed option. -- This message was sent by Atlassian JIRA (v6.2#6252)