Hi, We are using Cassandra 3.0.13 We have the following datacenters:
- DC1 with 7 Cassandra nodes with RF:3 (2 years old) - DC2 with 1 Cassandra node with RF:1 (4 years old) - DC3 with 2 Cassandra nodes with RF:2 (one-month-old) On DC2 and DC3, each node has 100% data. Seed nodes while setting up new-datacenter DC3 were: - 1 DC2 node - 1 DC1 node We are planning to remove the 4-year-old data center (DC2). We are seeing a discrepancy (around ~250GB) in the load of the nodes in DC2 and DC3 via the "nodetool status" command. Datacenter: DC2 > ================ > Status=Up/Down > |/ State=Normal/Leaving/Joining/Moving > -- Address Load Tokens Owns (effective) Host ID > Rack > UN 2.2.2.2 747.57 GB 256 100.0% efgh > RAC1 > > Datacenter: DC3 > ================ > Status=Up/Down > |/ State=Normal/Leaving/Joining/Moving > -- Address Load Tokens Owns (effective) Host ID > Rack > UN 3.3.3.3 504.57 GB 256 100.0% ijkl > RAC1 > UN 4.4.4.4 502.17 GB 256 100.0% mnop > RAC1 > > What could be the possible reasons for the 250gb data discrepancy? Also, we run a repair on every weekend. Thank you!