[ 
https://issues.apache.org/jira/browse/FALCON-325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14094925#comment-14094925
 ] 

Sowmya Ramesh commented on FALCON-325:
--------------------------------------

On the source cluster there already exists a relationship between Feed entity, 
Source cluster and Feed Instance as shown below

FeedEntity <-(instance-of) FeedInstance (stored-in) -> Srccluster

When replication policy is run; edge with relationship label replicated-to has 
to be added between FeedInstance and TargetCluster. 
Also FeedInstance vertex property has to be updated with timestamp of 
replication. 

To improve the performance for the retrieval query : say get me all the 
replicated instances for past 3 months
Add new REPLICATED property to FeedInstance vertex which was replicated. This 
would avoid going and looking for out edge with label replicated-to to get the 
replicated instances.

Please comment if you have any concerns about this approach. Thanks!       

> Process lineage information for Replication policies
> ----------------------------------------------------
>
>                 Key: FALCON-325
>                 URL: https://issues.apache.org/jira/browse/FALCON-325
>             Project: Falcon
>          Issue Type: Sub-task
>    Affects Versions: 0.5
>            Reporter: Venkatesh Seetharam
>            Assignee: Sowmya Ramesh
>              Labels: lineage
>
> The current patch only addresses process executions and not data lifecycle 
> policies. This task should address adding this information.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to