Load discrepancy between old vs new nodes of Cassandra

Saijal Chauhan Mon, 17 Aug 2020 05:00:38 -0700

Hi,

We are using Cassandra 3.0.13
We have the following datacenters:


   - DC1 with 7 Cassandra nodes with RF:3 (2 years old)
   - DC2 with 1 Cassandra node with RF:1 (4 years old)
   - DC3 with 2 Cassandra nodes with RF:2 (one-month-old)

On DC2 and DC3, each node has 100% data.
Seed nodes while setting up new-datacenter DC3 were:

   -  1 DC2 node
   -  1 DC1 node



We are planning to remove the 4-year-old data center (DC2).
We are seeing a discrepancy (around ~250GB) in the load of the nodes in DC2
and DC3 via the "nodetool status" command.

    Datacenter: DC2
>     ================
>     Status=Up/Down
>     |/ State=Normal/Leaving/Joining/Moving
>     --  Address          Load       Tokens       Owns (effective)  Host ID
>            Rack
>     UN  2.2.2.2          747.57 GB  256          100.0%            efgh
>             RAC1
>
>     Datacenter: DC3
>     ================
>     Status=Up/Down
>     |/ State=Normal/Leaving/Joining/Moving
>     --  Address          Load       Tokens       Owns (effective)  Host ID
>            Rack
>     UN  3.3.3.3          504.57 GB  256          100.0%            ijkl
>                RAC1
>     UN  4.4.4.4          502.17 GB  256          100.0%            mnop
>             RAC1
>

>
What could be the possible reasons for the 250gb data discrepancy?
Also, we run a repair on every weekend.

Thank you!

Load discrepancy between old vs new nodes of Cassandra

Reply via email to