If I understand correctly this is controlled by setting `auto_bootstrap`. If it is set to true (the default), once the node joins the cluster it will have some portion of the data assigned to it, and its data will be streamed to it from the other nodes. Once the data has finished streaming only then will this node start to answer queries. So to answer your question,
> Or is new nodes not used until all data is streamed? Yes, by default. You probably do not want to set `auto_bootstrap` to false. In fact, it is "hidden" in `cassandra.yaml` ( https://issues.apache.org/jira/browse/CASSANDRA-2447). To see why you do not want to set it to false there are a couple of nice articles: https://monzo.com/blog/2019/09/08/why-monzo-wasnt-working-on-july-29th https://thelastpickle.com/blog/2017/05/23/auto-bootstrapping-part1.html On Tue, Oct 22, 2019 at 10:54 AM Eunsu Kim <eunsu.bil...@gmail.com> wrote: > Hi experts, > > When a new node was added, how can the coordinator find data that has been > not yet streamed? > > Or is new nodes not used until all data is streamed? > > Thanks in advance > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org > For additional commands, e-mail: user-h...@cassandra.apache.org > >