Stéphane Loeuillet created ZOOKEEPER-4783:
---------------------------------------------

             Summary: leader crash because of zxid 32b rollover but no other 
server takes the lead
                 Key: ZOOKEEPER-4783
                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-4783
             Project: ZooKeeper
          Issue Type: Bug
    Affects Versions: 3.8.3
         Environment: Linux amd64 Ubuntu 20.04.5

Java OpenJDK17U-jre_x64_linux_hotspot_17.0.8.1_1.tar.gz
            Reporter: Stéphane Loeuillet
         Attachments: zookeeper_crash.log

Got a 5 node cluster running on baremetal servers (with NVMe) used by a 
ClickHouse cluster on a separate cluster.

This morning, a crash on the leader did let my clusters unusable as while the 
leader crashed, none of the 4 followers did take the lead

 

zookeeper leader was zookeeper08

05/06/07/09 were the followers

 

Only a restart of zookeeper05 process did unfreeze the whole cluster



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to