Keyspace has WITH replication = {'class': 'NetworkTopologyStrategy', 'DC1':
'3', 'us-east-productiondata': '3'} AND durable_writes = true;
From: Jeff Jirsa <[email protected]>
Reply-To: "[email protected]" <[email protected]>
Date: Monday, August 7, 2017 at 2:51 PM
To: cassandra <[email protected]>
Subject: Re: Different data size between datacenters
And when you say the data size is smaller, you mean per node? Or sum of all
nodes in the datacenter?
With 185 hosts in AWS vs 135 in your DC, I would expect your DC hosts to have
30% less data per host than AWS.
If instead they have twice as much, it sounds like it's balancing by # of
tokens instead, which may be an indication that you're somehow using
SimpleStrategy, or your NetworkTopologyStrategy is somehow misconfigured for
one or more keyspaces.
Can you paste your keyspace replication strategy lines, anonymized as needed?
On Mon, Aug 7, 2017 at 1:46 PM, Chuck Reynolds
<[email protected]<mailto:[email protected]>> wrote:
Yes to the NetworkTopologyStrategy.
From: Jeff Jirsa <[email protected]<mailto:[email protected]>>
Reply-To: "[email protected]<mailto:[email protected]>"
<[email protected]<mailto:[email protected]>>
Date: Monday, August 7, 2017 at 2:39 PM
To: cassandra <[email protected]<mailto:[email protected]>>
Subject: Re: Different data size between datacenters
You're using NetworkTopologyStrategy and not SimpleStrategy, correct?
On Mon, Aug 7, 2017 at 11:50 AM, Chuck Reynolds
<[email protected]<mailto:[email protected]>> wrote:
I have a cluster that spans two datacenters running Cassandra 2.1.12. 135
nodes in my data center and about 185 in AWS.
The size of the second data center (AWS) is quite a bit smaller. Replication
is the same in both datacenters. Is there a logical explanation for this?
thanks