As Kurt mentioned, you definitely need to pick a partition key that ensure
data is uniformly distributed.
If you want to want to redistribute the data in cluster and move tokens
around, you could decommission the node with the tokens you want to
redistribute and then bootstrap a new node into the
You should choose a partition key that enables you to have a uniform
distribution of partitions amongst the nodes and refrain from having too
many wide rows/a small number of wide partitions. If your tokens are
already uniformly distributed, recalculating in order to achieve a better
data load
Suppose I have a cassandra cluster with the data that is skewed such that
one node have 40% more data than other nodes.Since while creating the
cassandra the tokens were distributed uniformly.
Now to make the data uniform I have to recalculate the tokens and assign
them to nodes in the cluster.
see that some node(s) are using more disk space than others if some
> point of time especially if you do updates and not just inserts.
>
> Cheers,
> Hannu
>
> On 13 September 2017 at 07:47:09, Akshit Jain (akshit13...@iiitd.ac.in)
> wrote:
>
> Hi,
> Can a cassandra clus
others if some point of
time especially if you do updates and not just inserts.
Cheers,
Hannu
On 13 September 2017 at 07:47:09, Akshit Jain (akshit13...@iiitd.ac.in)
wrote:
Hi,
Can a cassandra cluster be unbalanced in terms of data?
If yes then how to rebalance a cassandra cluster.
.ac.in]
Sent: Tuesday, September 12, 2017 9:47 PM
To: user@cassandra.apache.org
Subject: Rebalance a cassandra cluster
Hi,
Can a cassandra cluster be unbalanced in terms of data?
If yes then how to rebalance a cassandra cluster.
Hi,
Can a cassandra cluster be unbalanced in terms of data?
If yes then how to rebalance a cassandra cluster.