[ https://issues.apache.org/jira/browse/KAFKA-12729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
luws updated KAFKA-12729: ------------------------- Description: kafka version. 2.12-2.1.1 after zk flash disconnected then broker6 become the controller leader but the other broker can't connect to the broker6 until we restart the broker6 then the cluster become recover. bandwidth is normal !image-2021-04-29-11-33-24-303.png! !image-2021-04-29-11-33-53-048.png! !image-2021-04-29-11-34-18-654.png! !image-2021-04-29-11-35-26-994.png! !image-2021-04-29-11-35-41-393.png! !image-2021-04-29-11-35-52-191.png! {code:java} //代码占位符 Connection to 6 was disconnected before the response was read at org.apache.kafka.clients.NetworkClientUtils.sendAndReceive(NetworkClientUtils.java:100) at kafka.server.ReplicaFetcherBlockingSend.sendRequest(ReplicaFetcherBlockingSend.scala:99) at kafka.server.ReplicaFetcherThread.fetchFromLeader(ReplicaFetcherThread.scala:192) at kafka.server.AbstractFetcherThread.processFetchRequest(AbstractFetcherThread.scala:274) at kafka.server.AbstractFetcherThread.$anonfun$maybeFetch$3(AbstractFetcherThread.scala:132) at kafka.server.AbstractFetcherThread.$anonfun$maybeFetch$3$adapted(AbstractFetcherThread.scala:131) at scala.Option.foreach(Option.scala:257) at kafka.server.AbstractFetcherThread.maybeFetch(AbstractFetcherThread.scala:131) at kafka.server.AbstractFetcherThread.doWork(AbstractFetcherThread.scala:113) at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:82){code} was: kafka version. 2.12-2.1.1 after zk flash disconnected then broker6 become the controller leader but the other broker can't connect to the broker6 until we restart the broker6 then the cluster become recover. bandwidth is normal !image-2021-04-29-11-24-22-704.png! !image-2021-04-29-11-27-12-924.png! !image-2021-04-29-11-27-35-679.png! !image-2021-04-29-11-25-41-208.png! !image-2021-04-29-11-26-34-894.png! {code:java} //代码占位符 Connection to 6 was disconnected before the response was read at org.apache.kafka.clients.NetworkClientUtils.sendAndReceive(NetworkClientUtils.java:100) at kafka.server.ReplicaFetcherBlockingSend.sendRequest(ReplicaFetcherBlockingSend.scala:99) at kafka.server.ReplicaFetcherThread.fetchFromLeader(ReplicaFetcherThread.scala:192) at kafka.server.AbstractFetcherThread.processFetchRequest(AbstractFetcherThread.scala:274) at kafka.server.AbstractFetcherThread.$anonfun$maybeFetch$3(AbstractFetcherThread.scala:132) at kafka.server.AbstractFetcherThread.$anonfun$maybeFetch$3$adapted(AbstractFetcherThread.scala:131) at scala.Option.foreach(Option.scala:257) at kafka.server.AbstractFetcherThread.maybeFetch(AbstractFetcherThread.scala:131) at kafka.server.AbstractFetcherThread.doWork(AbstractFetcherThread.scala:113) at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:82){code} > controller leader keep disconnected and block the cluster > --------------------------------------------------------- > > Key: KAFKA-12729 > URL: https://issues.apache.org/jira/browse/KAFKA-12729 > Project: Kafka > Issue Type: Bug > Components: controller, KafkaConnect > Affects Versions: 2.1.1 > Reporter: luws > Priority: Critical > Attachments: image-2021-04-29-11-33-24-303.png, > image-2021-04-29-11-33-53-048.png, image-2021-04-29-11-34-18-654.png, > image-2021-04-29-11-35-26-994.png, image-2021-04-29-11-35-41-393.png, > image-2021-04-29-11-35-52-191.png > > > kafka version. 2.12-2.1.1 > after zk flash disconnected > then broker6 become the controller leader > but the other broker can't connect to the broker6 until we restart the > broker6 then the cluster become recover. > > bandwidth is normal > !image-2021-04-29-11-33-24-303.png! > !image-2021-04-29-11-33-53-048.png! > !image-2021-04-29-11-34-18-654.png! > !image-2021-04-29-11-35-26-994.png! > !image-2021-04-29-11-35-41-393.png! > !image-2021-04-29-11-35-52-191.png! > > > > {code:java} > //代码占位符 > Connection to 6 was disconnected before the response was read at > org.apache.kafka.clients.NetworkClientUtils.sendAndReceive(NetworkClientUtils.java:100) > at > kafka.server.ReplicaFetcherBlockingSend.sendRequest(ReplicaFetcherBlockingSend.scala:99) > at > kafka.server.ReplicaFetcherThread.fetchFromLeader(ReplicaFetcherThread.scala:192) > at > kafka.server.AbstractFetcherThread.processFetchRequest(AbstractFetcherThread.scala:274) > at > kafka.server.AbstractFetcherThread.$anonfun$maybeFetch$3(AbstractFetcherThread.scala:132) > at > kafka.server.AbstractFetcherThread.$anonfun$maybeFetch$3$adapted(AbstractFetcherThread.scala:131) > at scala.Option.foreach(Option.scala:257) at > kafka.server.AbstractFetcherThread.maybeFetch(AbstractFetcherThread.scala:131) > at > kafka.server.AbstractFetcherThread.doWork(AbstractFetcherThread.scala:113) at > kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:82){code} > -- This message was sent by Atlassian Jira (v8.3.4#803005)