[ https://issues.apache.org/jira/browse/KAFKA-6128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16220293#comment-16220293 ]
Alastair Munro edited comment on KAFKA-6128 at 10/26/17 10:45 AM: ------------------------------------------------------------------ It does it with 0.11.0.0. It would seem the broker id's are not being replicated in zookeeper. So kafka connects to zookeeper:2181 which is a load balancer. Then when it terminates, it removes the broker-id on the zookeeper it connected to, but the change is not replicated to the other two nodes. As zookeeper may scale up and down, we use zookeeper:2181 rather than a list of hosts. Testing replication in zookeeper (eg creating /test using zkCli.sh), replication works fine. So why doesn't the broker changes replicate? I scaled this down to have only brokers 0 and 1, but the change is only made to the node the departing kafka connected to: {code} oc rsh zoo-0 bin/zkCli.sh ls /brokers/ids [0, 1] oc rsh zoo-1 bin/zkCli.sh ls /brokers/ids [0, 1, 2] {code} was (Author: amunro): It does it with 0.11.0.0. It would seem the broker id's are not being replicated in zookeeper. So kafka connects to zookeeper:2181 which is a load balancer. Then when it terminates, it removes the broker-id on the zookeeper it connected to, but the change is not replicated to the other two nodes. As zookeeper may scale up and down, we use zookeeper:2181 rather than a list of hosts. Testing replication in zookeeper (eg creating /test using zkCli.sh), replication works fine. So why doesn't the broker changes replicate? I scaled this down to have only brokers 0 and 1, but some of the zoo nodes don't see the change: {code} oc rsh zoo-0 bin/zkCli.sh ls /brokers/ids [0, 1] oc rsh zoo-1 bin/zkCli.sh ls /brokers/ids [0, 1, 2] {code} > Shutdown script does not do a clean shutdown > -------------------------------------------- > > Key: KAFKA-6128 > URL: https://issues.apache.org/jira/browse/KAFKA-6128 > Project: Kafka > Issue Type: Bug > Affects Versions: 0.11.0.1 > Reporter: Alastair Munro > Priority: Minor > > Shutdown script (sending term signal) does not do a clean shutdown. > We are running kafka in kubernetes/openshift 0.11.0.0. The statefulset kafka > runs the shutdown script prior to stopping the pod kafka is running on: > {code} > lifecycle: > preStop: > exec: > command: > - ./bin/kafka-server-stop.sh > {code} > This worked perfectly in 0.11.0.0 but doesn't in 0.11.0.1. Also we see the > same behaviour if we send a TERM signal to the kafka process (same as the > shutdown script). -- This message was sent by Atlassian JIRA (v6.4.14#64029)