Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Cassandra Wiki" for 
change notification.

The "Operations" page has been changed by JonathanEllis:
http://wiki.apache.org/cassandra/Operations?action=diff&rev1=106&rev2=107

Comment:
cleanup

  
   1. You should wait long enough for all the nodes in your cluster to become 
aware of the bootstrapping node via gossip before starting another bootstrap.  
The new node will log "Bootstrapping" when this is safe, 2 minutes after 
starting.  (90s to make sure it has accurate load information, and 30s waiting 
for other nodes to start sending it inserts happening in its to-be-assumed part 
of the token ring.)
   1. Relating to point 1, one can only bootstrap N nodes at a time with 
automatic non-vnode token picking, where N is the size of the existing cluster. 
If you need to more than double the size of your cluster, you have to wait for 
the first N nodes to finish until your cluster is size 2N before bootstrapping 
more nodes. So if your current cluster is 5 nodes and you want add 7 nodes, 
bootstrap 5 and let those finish before bootstrapping the last two.
-  1. As a safety measure, Cassandra does not automatically remove data from 
nodes that "lose" part of their Token Range to a newly added node.  Run 
`nodetool cleanup` on the source node(s) (neighboring nodes that shared the 
same subrange) when you are satisfied the new node is up and working. If you do 
not do this the old data will still be counted against the load on that node 
and future bootstrap attempts at choosing a location will be thrown off.
+  1. As a safety measure, Cassandra does not automatically remove data from 
nodes that "lose" part of their Token Range to a newly added node.  Run 
`nodetool cleanup` on the source node(s) (neighboring nodes that shared the 
same subrange) when you are satisfied the new node is up and working. If you do 
not do this the old data will still be counted against the load on that node.
-  1. During bootstrap, a node will not bind the Thrift port until finished.
-  1. Bootstrap can take many hours when a lot of data is involved.  See 
[[Streaming]] for how to monitor progress.
  
  Cassandra is smart enough to transfer data from the nearest source node(s), 
if your !EndpointSnitch is configured correctly.  So, the new node doesn't need 
to be in the same datacenter as the primary replica for the Range it is 
bootstrapping into, as long as another replica is in the datacenter with the 
new one.
  
  Bootstrap progress can be monitored using `nodetool` with the `netstats` 
argument.
  
- During bootstrap `nodetool` may report that the new node is not receiving nor 
sending any streams, in which case is may be building secondary indexes, 
visible in `compactionstats`
+ During bootstrap `nodetool` may report that the new node is not receiving nor 
sending any streams, in which case is may be building secondary indexes, 
visible in `compactionstats`.
  
  == Moving or Removing nodes ==
  === Removing nodes entirely ===

Reply via email to