[
https://issues.apache.org/jira/browse/FALCON-325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14094925#comment-14094925
]
Sowmya Ramesh commented on FALCON-325:
--------------------------------------
On the source cluster there already exists a relationship between Feed entity,
Source cluster and Feed Instance as shown below
FeedEntity <-(instance-of) FeedInstance (stored-in) -> Srccluster
When replication policy is run; edge with relationship label replicated-to has
to be added between FeedInstance and TargetCluster.
Also FeedInstance vertex property has to be updated with timestamp of
replication.
To improve the performance for the retrieval query : say get me all the
replicated instances for past 3 months
Add new REPLICATED property to FeedInstance vertex which was replicated. This
would avoid going and looking for out edge with label replicated-to to get the
replicated instances.
Please comment if you have any concerns about this approach. Thanks!
> Process lineage information for Replication policies
> ----------------------------------------------------
>
> Key: FALCON-325
> URL: https://issues.apache.org/jira/browse/FALCON-325
> Project: Falcon
> Issue Type: Sub-task
> Affects Versions: 0.5
> Reporter: Venkatesh Seetharam
> Assignee: Sowmya Ramesh
> Labels: lineage
>
> The current patch only addresses process executions and not data lifecycle
> policies. This task should address adding this information.
--
This message was sent by Atlassian JIRA
(v6.2#6252)