Actually, it was not just a zookeeper process down, but whole machine dies due to kernel panic, and ping to that server failed accordingly.All servers were connected to each other and in replication mode, of course. -----Original Message----- From: "Irek Khasyanov"<[email protected]> To: "user"<[email protected]>; "이승진"<[email protected]>; Cc: Sent: 2014-07-04 (금) 14:51:00 Subject: Re: storm crashes when one of the zookeeper server dies Are you sure all zookeeper servers are in replication mode and can connect each other? Yesterday we added zookeeper cluster and checked what happens with storm when one zookeeper fails. Everything was good, nothing happens with topology.
On 4 July 2014 08:26, 이승진 <[email protected]> wrote: in storm.yaml, we listed 3 zookeeper servers storm.zookeeper.servers: -"host1" -"host2" -"host3" and host1 dies unexpectedly today morning, since then, not only I cannot connect to storm UI (TTransport exception) but also can't execute any of storm command. I was quite worried when this happened, because if it's what storm is supposed to be, one of the zookeeper server can be a SPOF. seems like it should be fixed ASAP Sincerly, -- With best regards, Irek Khasyanov.
