[ 
https://issues.apache.org/jira/browse/OOZIE-2062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14207704#comment-14207704
 ] 

Shwetha G S commented on OOZIE-2062:
------------------------------------

Yes, we can delete old coord actions for long running coordinators. But 
deleting coord action checks if the corresponding workflow is complete. 
Deleting workflow checks if its sub-workflows are complete. All these checks 
take time and overall purge takes longer. Instead, if we just delete all 
completed workflows older than some time(without checking about sub-workflows) 
and delete all coord actions older than some time(without checking their 
corresponding workflows), purge is much faster. This is also a simpler logic to 
maintain. 

Both yahoo and inmobi have implemented this logic as an external cron. The idea 
is to add it to oozie as an alternate purging logic


> Provide alternate purging logic
> -------------------------------
>
>                 Key: OOZIE-2062
>                 URL: https://issues.apache.org/jira/browse/OOZIE-2062
>             Project: Oozie
>          Issue Type: Improvement
>            Reporter: Shwetha G S
>            Assignee: Jaydeep Vishwakarma
>
> Currently, purge service deletes a workflow only if all its children are 
> complete, coord action only if the corresponding workflow and its 
> sub-workflows are complete. This logic is complicated and takes longer to 
> run. Instead, provide an alternative purging logic that deleted all workflows 
> whose end time is older than configured time, delete coord actions whose end 
> time is older than configured time and so on. The purging logic to be used 
> can be configured from oozie-site (by default can default to the current 
> logic for backward compatbility)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to