Hi,
We are using Cassandra 3.0.13
We have the following datacenters:
- DC1 with 7 Cassandra nodes with RF:3 (2 years old)
- DC2 with 1 Cassandra node with RF:1 (4 years old)
- DC3 with 2 Cassandra nodes with RF:2 (one-month-old)
On DC2 and DC3, each node has 100% data.
Seed nodes while setting up new-datacenter DC3 were:
- 1 DC2 node
- 1 DC1 node
We are planning to remove the 4-year-old data center (DC2).
We are seeing a discrepancy (around ~250GB) in the load of the nodes in DC2
and DC3 via the "nodetool status" command.
Datacenter: DC2
> ================
> Status=Up/Down
> |/ State=Normal/Leaving/Joining/Moving
> -- Address Load Tokens Owns (effective) Host ID
> Rack
> UN 2.2.2.2 747.57 GB 256 100.0% efgh
> RAC1
>
> Datacenter: DC3
> ================
> Status=Up/Down
> |/ State=Normal/Leaving/Joining/Moving
> -- Address Load Tokens Owns (effective) Host ID
> Rack
> UN 3.3.3.3 504.57 GB 256 100.0% ijkl
> RAC1
> UN 4.4.4.4 502.17 GB 256 100.0% mnop
> RAC1
>
>
What could be the possible reasons for the 250gb data discrepancy?
Also, we run a repair on every weekend.
Thank you!