Here is a problem:

My customer has a 45Megabit connection to their off-site DR data center. They have about 500G worth of data. That connection is shared. Needless to say this is not an optimal configuration.

To replicate all that in real time it'll take a week.

My primary cluster is 4 nodes, RF=2. DR cluster is also 4 nodes, RF=2.

I need a way to somehow setup the primary cluster, populate all the data, then transfer it to the DR cluster.



One suggestion is:

1) Setup the primary cluster, plus configure a Mac Mini as a backup data center but on the same network
2) Populate the data
3) Physically take Mac Mini to the DR data center and transfer its data to one of the nodes and then run nodetool cleanup to move the data around amongs nodes.

Now… this doesn't strike me as optimal. I feel like I'll need to run repair on the new cluster, which defeats the purpose -- it'll just hog the 45Megabit pipe…

Somehow I need away to load all the data into primary cluster, then ship it over to the backup in a more timely fashion…

Any suggestions are greatly appreciated.

Also,  I need a way to know if the replication is up to date or not.

--
Regards,
Oleg Dulin
http://www.olegdulin.com


Reply via email to