Hi, Thank you for your answer.
Zookeeper version I'm doing tests on : 3.4.5-1392090, built on 09/30/2012 17:52 GMT Reconfig feature will be available in version: 3.5.0. Yes, you are right with rolling restart, but while I restart second service, whole cluster will be down. I'm trying to avoid downtime. -- cheers mc On Wed, Feb 19, 2014 at 12:31 PM, German Blanco < [email protected]> wrote: > So servers do check the sid in the server list. Sorry about that. Maybe you > are running trunk and I work mainly with 3.4.5, or maybe I was completely > wrong. > Why don't you just update all files first, and then do a rolling restart? > The first server that is restarted will not be able to join the quorum, but > hopefully when you restart the second it will form a quorum with the first > and then when you restart the third everything is back to normal. > Or try the reconfig feature? I haven't tried it myself, but it should be > more or less the same as updating the files. > > > On Wed, Feb 19, 2014 at 1:08 PM, Marcin Cabaj <[email protected] > >wrote: > > > Ok, I fixed it, > > > > the only thing I changed was zoo.conf > > server.1=zoo0:2888:3888 > > server.0=zoo1:2888:3888 > > and restarted zoo0. > > > > In the meantime I've created test ensemble, to test 'changing' server ID, > > my start configuration: > > > > zoo1.conf, zoo2.conf: zoo3.conf: > > server.41=localhost:2888:3888 > > server.42=localhost:2889:3889 > > server.43=localhost:2890:3890 > > > > zoo1/myid = 41 > > zoo2/myid = 42 > > zoo3/myid = 43 > > > > zoo3 is the LEADER > > > > At this moment, I can't change server id of followers:( > > What I do: > > 1) change zoo1/myid = 1 > > 2) change zoo1.conf: > > server.1=localhost:2888:3888 > > server.42=localhost:2889:3889 > > server.43=localhost:2890:3890 > > 3) restart zoo1 > > > > in logs I see that LEADER complains about invalid server id: 1 > > 2014-02-19 12:05:50,534 [myid:43] - WARN [localhost/127.0.0.1:3890 > > :QuorumCnxManager@344] - Invalid server id: 1 > > > > Question: How to change server ID one of the servers without shutting > down > > whole ensemble? > > > > -- > > cheers > > mc > > > > > > > > On Tue, Feb 18, 2014 at 3:54 PM, German Blanco < > > [email protected]> wrote: > > > > > Leave it as it is. Servers do no check if the sid from another server > is > > in > > > that list ... At least I believe they don't, and my experience so far > > > confirms it. And if they did it strictly, you wouldn't have reached > your > > > current state. > > > > > > On Tuesday, February 18, 2014, Marcin Cabaj <[email protected] > > > > > wrote: > > > > > > > Thanks, will try it tomorrow. > > > > One thing I'm wondering, if I set zoo0 id to eg 5, should I update > > > zoo.cfg > > > > on other servers? > > > > If so restart is needed as well right? It will crash my cluster. Or > > just > > > > leave zoo.cfg as is? > > > > > > > > -- > > > > cheers > > > > mc > > > > > > > > > > > > On Tue, Feb 18, 2014 at 1:41 PM, German Blanco < > > > > [email protected] <javascript:;>> wrote: > > > > > > > > > For this step: > > > > > "Set a different id in the myid file of server 0 (the one that is > > > down), > > > > > restart it, verify that it joins the quorum." any value that is not > > > used > > > > > should do, e.g. 3, 4, 5, 1231 ... > > > > > > > > > > > > > > > On Tue, Feb 18, 2014 at 12:04 PM, German Blanco < > > > > > [email protected] <javascript:;>> wrote: > > > > > > > > > > > Hello! > > > > > > Set a different id in the myid file of server 0 (the one that is > > > down), > > > > > > restart it, verify that it joins the quorum. > > > > > > If it joins the quorum, set the myid value in server 1 to one, > > > restart > > > > > it, > > > > > > verify that it joins the quorum. > > > > > > If it joins the quorum, update again the myid file of server 0, > > this > > > > time > > > > > > to the correct 0 value. Restart, verify that it all works. > > > > > > > > > > > > If any of the steps fails, stop and think it all over again. > > > > > > > > > > > > Good luck. > > > > > > > > > > > > > > > > > > On Tuesday, February 18, 2014, Marcin Cabaj < > > > [email protected]<javascript:;> > > > > > > > > > > > wrote: > > > > > > > > > > > >> Hi all, > > > > > >> > > > > > >> My ZooKeeper ensemble contains 3 servers, unfortunately somehow > > > > servers > > > > > >> ids > > > > > >> have been messed up. > > > > > >> > > > > > >> zoo.cfg on all servers: > > > > > >> server.0=zoo0:2888:3888 > > > > > >> server.1=zoo1:2888:3888 > > > > > >> server.2=zoo2:2888:3888 > > > > > >> > > > > > >> but: > > > > > >> on ZOO0: > > > > > >> [xxx@zoo0]$ cat /var/zookeeper/myid > > > > > >> 1 > > > > > >> [xxx@zoo0]$ echo conf | nc localhost 2181 > > > > > >> This ZooKeeper instance is not currently serving requests > > > > > >> > > > > > >> on ZOO1: > > > > > >> [xxx@zoo1] $ cat /var/zookeeper/myid > > > > > >> 0 > > > > > >> [xxx@zoo1:~]$ echo conf | nc localhost 2181 | grep serverId > > > > > >> > > > > > >> serverId=0 > > > > > >> > > > > > >> on ZOO2: > > > > > >> [xxx@zoo2:~]$ cat /var/zookeeper/myid > > > > > >> 2 > > > > > >> [xxx@zoo2:~]$ echo conf | nc localhost 2181 | grep serverId > > > > > >> serverId=2 > > > > > >> > > > > > >> How to fix this without shutting down whole ensemble? > > > > > >> Currently I have connections established to ZOO1 and ZOO2. > > > > > >> ZOO0 is listening on 2181 but doesn't accept connections. > > > > > >> ZOO2 is the leader. > > > > > >> > > > > > >> Zookeeper version: 3.3.5-cdh3u5--1, built on 10/06/2012 01:58 > GMT > > > > > >> > > > > > >> Cheers > > > > > >> > > > > > > > > > > > > > > > > > > > > >
