[ 
https://issues.apache.org/jira/browse/OOZIE-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Purshotam Shah updated OOZIE-1864:
----------------------------------

    Summary: Improve child job id aggregation logic  (was: Improve chid job id 
aggregation logic)

> Improve child job id aggregation logic
> --------------------------------------
>
>                 Key: OOZIE-1864
>                 URL: https://issues.apache.org/jira/browse/OOZIE-1864
>             Project: Oozie
>          Issue Type: Bug
>            Reporter: Purshotam Shah
>
> Improve chid job id aggregation logic
> Current chid job id aggregation logic
> Once launcher job complete submitting child job (jobs in case on pig), it 
> writes jobID to file.
> From Oozie server side, we collect childID in two ways
> 1. As soon as we submit launcher jobs, we check if launcher job terminated or 
> not. If it's terminated, we read child-id from file and populated to DB.  And 
> once kill command is issued we kill all child jobs.
> 2. We have a timer task (ActionCheckerService) which keeps on checking the 
> status of all running actions and if launcher job is terminated, it's update 
> the DB with childIDs.
> Jobend notification is rejected if action is not running.  
> Assume that launcher is killed after it has submitted child job.
> Child job will never be killed.
> To fix this, we should do following things.
> 1. If oozie receives job end notification and if launcher job is killed, 
> collect all child job and kill them if they are not killed.
> 2. Have a better way logic to collect child job id. Launcher job can call 
> callbackServlet ( may be periodically) to update child job ids. This could be 
> useful in pig jobs. In current  scenario we report child jobs job only when 
> launcher job completes.
>  



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to