[
https://issues.apache.org/jira/browse/FALCON-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15085035#comment-15085035
]
pavan kumar kolamuri commented on FALCON-1728:
----------------------------------------------
[~bvellanki] In inmobi, we have to run same process across multiple clusters
and input data will be different across clusters and we will write data to
output feeds. There wont be any corruption in data since in each cluster
process will write to its cluster output path. Output data will be different in
each clusters even though path is same. I hope this use case is very common if
users have multiple clusters
> Process entity definition allows multiple clusters when it has output Feed
> defined.
> ------------------------------------------------------------------------------------
>
> Key: FALCON-1728
> URL: https://issues.apache.org/jira/browse/FALCON-1728
> Project: Falcon
> Issue Type: Bug
> Components: process
> Affects Versions: 0.9
> Reporter: Balu Vellanki
> Assignee: Balu Vellanki
> Priority: Critical
>
> Process XSD allows user to specify multiple clusters per process entity. I am
> guessing this would allow a user to run duplicate instance of the process on
> multiple clusters at the same time (I do not really see a need for this).
> When the process has an output feed defined, you can have duplicate process
> instances writing to same feed instance, causing data corruption/failures.
> The solution is to
> 1. Do not allow multiple clusters per process. Let the user define a
> duplicate process if user wants to run duplicate instances.
> OR
> 2. Allow multiple clusters, but only when there is no output feed defined.
> [~sriksun] please let me know if there is any other reason for allowing
> multiple clusters in a process.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)