about validity of recipe A node join using external data copy methods

2013-01-08 Thread DE VITO Dominique
Hi,

Edward Capriolo described in his Cassandra book a faster way [1] to start new 
nodes if the cluster size doubles, from N to 2 *N.

It's about splitting in 2 parts each token range taken in charge, after the 
split, with 2 nodes: the existing one, and a new one. And for starting a new 
node, one needs to:
- copy the data records from the corresponding node (without the system 
records)
- start the new node with auto_bootstrap: false

This raises 2 questions:

A) is this recipe still valid with v1.1 and v1.2 ?

B) do we still need to start the new node with auto_bootstrap: false ?
My guess is yes as the happening of the bootstrap phase is not recorded into 
the data records.

Thanks.

Dominique

[1] see recipe A node join using external data copy methods, page 165


Re: about validity of recipe A node join using external data copy methods

2013-01-08 Thread Edward Capriolo
Basically this recipe is from the old days when we had anti-compaction. Now
streaming is very efficient rarely fails and there is no need to do it this
way anymore. This recipe will be abolished from the second edition. It
still likely works except when using counters.

Edward

On Tue, Jan 8, 2013 at 7:27 AM, DE VITO Dominique 
dominique.dev...@thalesgroup.com wrote:

   Hi,



 Edward Capriolo described in his Cassandra book a faster way [1] to start
 new nodes if the cluster size doubles, from N to 2 *N.



 It's about splitting in 2 parts each token range taken in charge, after
 the split, with 2 nodes: the existing one, and a new one. And for starting
 a new node, one needs to:

 - copy the data records from the corresponding node (without the system
 records)

 - start the new node with auto_bootstrap: false



 This raises 2 questions:



 A) is this recipe still valid with v1.1 and v1.2 ?



 B) do we still need to start the new node with auto_bootstrap: false ?

 My guess is yes as the happening of the bootstrap phase is not recorded
 into the data records.



 Thanks.



 Dominique



 [1] see recipe A node join using external data copy methods, page 165



RE: about validity of recipe A node join using external data copy methods

2013-01-08 Thread DE VITO Dominique
 Now streaming is very efficient rarely fails and there is no need to do it 
this way anymore

I guess it's true in v1.2.
Is it true also in v1.1 ?

Thanks.

Dominique


De : Edward Capriolo [mailto:edlinuxg...@gmail.com]
Envoyé : mardi 8 janvier 2013 16:01
À : user@cassandra.apache.org
Objet : Re: about validity of recipe A node join using external data copy 
methods

Basically this recipe is from the old days when we had anti-compaction. Now 
streaming is very efficient rarely fails and there is no need to do it this way 
anymore. This recipe will be abolished from the second edition. It still likely 
works except when using counters.

Edward

On Tue, Jan 8, 2013 at 7:27 AM, DE VITO Dominique 
dominique.dev...@thalesgroup.commailto:dominique.dev...@thalesgroup.com 
wrote:
Hi,

Edward Capriolo described in his Cassandra book a faster way [1] to start new 
nodes if the cluster size doubles, from N to 2 *N.

It's about splitting in 2 parts each token range taken in charge, after the 
split, with 2 nodes: the existing one, and a new one. And for starting a new 
node, one needs to:
- copy the data records from the corresponding node (without the system 
records)
- start the new node with auto_bootstrap: false

This raises 2 questions:

A) is this recipe still valid with v1.1 and v1.2 ?

B) do we still need to start the new node with auto_bootstrap: false ?
My guess is yes as the happening of the bootstrap phase is not recorded into 
the data records.

Thanks.

Dominique

[1] see recipe A node join using external data copy methods, page 165



Re: about validity of recipe A node join using external data copy methods

2013-01-08 Thread Edward Capriolo
It has been true since about 0.8. in the old days ANTI-COMPACTION stunk and
many weird errors would cause node joins to have to be retried N times.

Now node moves/joins seem to work near 100% of the time (in 1.0.7) they are
also very fast and efficient.

If you want to move a node to new hardware you can do it with rsync, but I
would not use the technique for growing the cluster. It is error prone, and
ends up being more work.

On Tue, Jan 8, 2013 at 10:57 AM, DE VITO Dominique 
dominique.dev...@thalesgroup.com wrote:

Now streaming is very efficient rarely fails and there is no need to
 do it this way anymore



 I guess it's true in v1.2.

 Is it true also in v1.1 ?



 Thanks.



 Dominique





 *De :* Edward Capriolo [mailto:edlinuxg...@gmail.com]
 *Envoyé :* mardi 8 janvier 2013 16:01
 *À :* user@cassandra.apache.org
 *Objet :* Re: about validity of recipe A node join using external data
 copy methods



 Basically this recipe is from the old days when we had anti-compaction.
 Now streaming is very efficient rarely fails and there is no need to do it
 this way anymore. This recipe will be abolished from the second edition. It
 still likely works except when using counters.



 Edward



 On Tue, Jan 8, 2013 at 7:27 AM, DE VITO Dominique 
 dominique.dev...@thalesgroup.com wrote:

 Hi,



 Edward Capriolo described in his Cassandra book a faster way [1] to start
 new nodes if the cluster size doubles, from N to 2 *N.



 It's about splitting in 2 parts each token range taken in charge, after
 the split, with 2 nodes: the existing one, and a new one. And for starting
 a new node, one needs to:

 - copy the data records from the corresponding node (without the system
 records)

 - start the new node with auto_bootstrap: false



 This raises 2 questions:



 A) is this recipe still valid with v1.1 and v1.2 ?



 B) do we still need to start the new node with auto_bootstrap: false ?

 My guess is yes as the happening of the bootstrap phase is not recorded
 into the data records.



 Thanks.



 Dominique



 [1] see recipe A node join using external data copy methods, page 165