Hey all, We currently have the following use case: Colo1 has 1 ETL cluster (Colo1-ETL) and 1 adhoc cluster (Colo1-A) Colo2 has 1 ETL cluster (Colo2-ETL) and 1 adhoc cluster (Colo2-A)
Due to the bandwidth constraint between the two colo's, we are thinking of having the 2 ETL clusters perform the same computation to generate the same dataset, and have the 2 adhoc clusters pull from their respective colo-local ETL cluster. What would be a good way to configure this feed? I've tried the following: <cluster name=“colo1ETL type="source"> <cluster name="colo2ETL" type="source"> <cluster name=“colo1A” type="target"> <cluster name="colo2A” type="target"> Error: Partition expression has to be specified for cluster colo1ETL as there are more than one source clusters <cluster name=“colo1ETL”> <cluster name="colo2ETL”> <cluster name=“colo1A” type="target"> <cluster name="colo2A” type="target"> Error: Feed: pve-intermediate should have atleast one source cluster defined Thanks! John
