Which variables (for instance: throughput, CPU, I/O, connections) are
leading in deciding to add a node to a Cassandra setup which is put
under strain. We are trying to proove scalibility, but when is the time
there to add a node and have the optimum scalibilty result.
It's a difficult questions to answer in the abstract. Some thoughts...
Scaling by adding one node at time is not optimal. The best case scenario is
to double the number of nodes, as this means existing nodes only have to stream
their data to a new node. Obviously this is not always possible.