Stig Rohde Døssing created KAFKA-3635:
-----------------------------------------
Summary: Broker leadership state can become inconsistent between
Zookeeper/MetadataCache and the ReplicaManager's Partition
Key: KAFKA-3635
URL: https://issues.apache.org/jira/browse/KAFKA-3635
Project: Kafka
Issue Type: Bug
Affects Versions: 0.9.0.1
Reporter: Stig Rohde Døssing
Priority: Critical
On a 3 broker cluster, broker 2 and 3 managed to get into a state where neither
was fully the leader. The state in Zookeeper and the metadata cache (accessed
via TopicMetadataRequest) states that broker 3 is the leader, but trying to
produce/consume messages from that broker throws
NotLeaderForPartitionException. Meanwhile, consuming from broker 2 is possible,
but a metadata request to it reports that broker 3 is the leader.
Since the leader lookup when handling fetch requests is via Partition and via
MetadataCache for metadata requests, I'm guessing the two can get into an
inconsistent state.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)