[ 
https://issues.apache.org/jira/browse/FALCON-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14909680#comment-14909680
 ] 

Ajay Yadava commented on FALCON-965:
------------------------------------

{quote}
We don't want users to be building co-ords, bundles. It can be more like, the 
builder returns (or points) to the user-workflow to execute (and properties) 
for a given policy.
{quote}
Umm..I think there is some disconnect and you got the wrong impression of the 
requirements, let me clarify. 
Lifecycle doesn't require users to build bundles. 
Scheduling parameters like validity, frequency etc. will be governed by 
lifecycle, hence policies will need to guide scheduler. It is unavoidable 
irrespective of how it is built, natively inside falcon or by extension. 

Also, native scheduler is not a "workflow engine" it is just a friendly 
neighborhood "scheduler"! A scheduler takes care of the scheduling parameters 
and workflow engine does the actual execution. 

For scheduler we will have inbuilt support for - native scheduler and oozie 
scheduler(through coordinators).

Workflow Engine, is pluggable and we have native support for oozie workflow 
engine.


Currently falcon code has a very tight coupling between the two as the 
EntityBuilder builds bundles which in turn builds the coordinators and which in 
turn builds the workflow engine. EntityBuilders deal only at bundle level. I 
think this needs to change now, with the advent of lifecycle and in future, 
native scheduler. I have already created a JIRA for this FALCON-1448 for 
decoupling this and other clean up tasks. I am just waiting for native 
scheduler to be committed.

I have also created a JIRA FALCON-1478 to enable lifecycle through native 
scheduler and assigned it to myself, will take care of it on priority after at 
least the base framework gets committed. 

{quote}
Will the retention element be used as fallback, in case, lifecycle fails 
validation?
{quote}
No, there are no fallbacks. It will be very counter intuitive for the user 
where he wanted to specify retention through lifecycle and we silently fell 
back to original retention. I have documented this behavior in the docs as well.

{quote}
Since only retention is going in now, replication will be specified in the old 
way. In this case, the OozieWorkflowEngine will process the feed and generate 
appropriate bundles. If retention is specified too, how does it know to ignore 
it?
{quote}
If retention is defined through lifecycle, retention tag is ignored and we 
build retention coordinator through lifecycle. Perhaps you missed the change, 
please refer to FeedBundleBuilder class.


> Open up life cycle stage implementation within Falcon for extension
> -------------------------------------------------------------------
>
>                 Key: FALCON-965
>                 URL: https://issues.apache.org/jira/browse/FALCON-965
>             Project: Falcon
>          Issue Type: New Feature
>    Affects Versions: 0.7
>            Reporter: Srikanth Sundarrajan
>            Assignee: Ajay Yadava
>              Labels: recipes
>             Fix For: 0.8
>
>         Attachments: FALCON-965-v1.patch, FALCON-965-v2.patch, 
> FALCON-965.patch, FalconLifecycle-Designdoc.pdf, xsd.patch
>
>
> As it stands Falcon supports replication, generation and eviction lifecycle 
> stages and plans to support more. This however assumes a certain way of 
> implementing a life cycle function and changes to these implementation aren't 
> easy, as they are not open for easy extension. This proposed feature is open 
> this up in Falcon.
> Here is a proposal on how things can possibly be:
> * List of life cycles that Falcon supports would be well known and not 
> extensible
> * Dependency between life cycles are coded up in the falcon server and not 
> necessarily extensible. (In short adding a new life cycle still requires 
> changes in Falcon)
> * Each Lifecycle in falcon advertises an implementation interface and minimum 
> configuration interface (for ex. Eviction should expose a way to retrieve the 
> configured time limit for which data will be available for other life cycle 
> stages to validate. There is no point in having a process consume last 24 
> instances of a feed, when the retention will retain only 4 instances)
> * Similar to FALCON-634, life cycle implementation can be dropped in as long 
> as the implementation interface and configuraion interfaces are adhered to.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to