On configuring two source clusters due to colo requirement

John Yu Mon, 14 Jul 2014 11:24:03 -0700

Hey all,

We currently have the following use case:
Colo1 has 1 ETL cluster (Colo1-ETL) and 1 adhoc cluster (Colo1-A)
Colo2 has 1 ETL cluster (Colo2-ETL) and 1 adhoc cluster (Colo2-A)


Due to the bandwidth constraint between the two colo's, we are thinking of
having the 2 ETL clusters perform the same computation to generate the same
dataset, and have the 2 adhoc clusters pull from their respective
colo-local ETL cluster.

What would be a good way to configure this feed?

I've tried the following:
<cluster name=“colo1ETL type="source">
<cluster name="colo2ETL" type="source">
<cluster name=“colo1A” type="target">
<cluster name="colo2A” type="target">
Error: Partition expression has to be specified for cluster colo1ETL as
there are more than one source clusters

<cluster name=“colo1ETL”>
<cluster name="colo2ETL”>
<cluster name=“colo1A” type="target">
<cluster name="colo2A” type="target">
Error: Feed: pve-intermediate should have atleast one source cluster defined


Thanks!

John

On configuring two source clusters due to colo requirement

Reply via email to