On Wed, Mar 3, 2010 at 9:27 AM, Eric Evans <eev...@rackspace.com> wrote: > On Wed, 2010-03-03 at 16:49 +0000, Christopher Brind wrote: >> So is the current general practice to connect to a known node, e.g. by >> ip address? > > There are so many ways you could tackle this but... > > If you're talking about provisioning/startup of new nodes, just use the > IPs of 2-4 nodes in the seeds section of configs. > > If you're talking about clients, then round-robin DNS is one option. > Load-balancers are another. Either could be used with a subset of > higher-capacity/higher-availability nodes, or for the entire cluster. > >> If so, what happens if that node is down? Is the entire cluster >> effectively broken at that point? > > You don't use just one node, see above. > >> Or do clients simply maintain a list of nodes a just connect to the >> first available in the list? > > It's possible to obtain a list of nodes over Thrift. So, yet another > option would be to use a short-list of well-known nodes (discovered via > round-robin DNS for example), to obtain a current node list and > distribute among them.
This is exactly what we do. -ryan