On May 3, 2011, at 12:35 PM, Jeff Pollard wrote: > Grant, > > Thanks for the reply. I think I understand what you're saying, but it's not > 100% clear to me how exactly it applies to my question. I'll try explaining > my plan another way if that helps. > > Imagine we have 4 nodes running in a ring on EC2: [email protected], > [email protected], [email protected], [email protected]. Then one day node [email protected] > goes down and is unrecoverable. Our plan had been: > > # On a existing node in the cluster > riak-admin remove [email protected] > riak-admin ringready > # .. wait for TRUE > > # Boot a new EC2 machine configured as [email protected] (brand new IP) > # Then on that new machine... > riak start > riak-admin join [email protected] > riak-admin ringready > # .. wait for TRUE > > In that scenario, I imagine the issue is that removing and adding a new node > under a new node name is putting extra strain on the cluster to shift data > around? Simply replacing a new node under the same hostname (even though it > has a different IP) would mean less work for the cluster? And how much extra > work are >
You're correct in assuming that there is extra strain on the cluster moving that data around, but the amount of that churn will depend on how much data you have in your Riak cluster and your use case. Removing and adding a node is a great way to replace a broken node, but if you have a large amount of data it might be quicker and load the cluster less to replace the broken node with a node that has the same hostname or IP and the old nodes data directory. If [email protected] dies due to a disk problem and your backups won't contain the data you would need to replace the node as you described. The best solution for replacing a dead node will vary based on circumstances, your use case, and your availability needs. Grant > Thanks! > > On Tue, May 3, 2011 at 6:24 AM, Grant Schofield <[email protected]> wrote: > > On May 2, 2011, at 7:58 PM, Jeff Pollard wrote: > >> I was reviewing the Riak Operations webinar, and it was mentioned that the >> preferred vm.args -name for EC2 environments should be "riak@hostname" >> because you don't have to "rename data or do anything weird" like you would >> if your nodes were named "[email protected]" (approximately 40:05 in the >> video). >> >> I was looking for some elaboration on this tip, namely: >> What is meant by "rename data or do anything weird" > > When you bring a cluster together using data copied from a different set of > nodes you need to re-ip the first node you plan to start but you also have to > change the ring manually on that node so when the subsequent nodes join > everything works properly. > >> Is "hostname" in riak@hostname a public DNS host that you configure in your >> DNS to map to the EC2 public hostname (ec2-50-18-...)? > > You can use DNS (public or private) or host file entries that reference the > private IP of the node if you choose to use hostname. > >> Does anyone have any best practices around vm.args -name in EC2 environments? > > We haven't outlined any best practices ourselves, but I tend to believe using > a hostname that you can change the IP for via DNS or a hosts file is a more > flexible way of approaching the problem. > > Grant > > >> Thanks! >> _______________________________________________ >> riak-users mailing list >> [email protected] >> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com > >
_______________________________________________ riak-users mailing list [email protected] http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
