David Lao created KAFKA-1106:
--------------------------------
Summary: HighwaterMarkCheckpoint failure puting broker into a bad
state
Key: KAFKA-1106
URL: https://issues.apache.org/jira/browse/KAFKA-1106
Project: Kafka
Issue Type: Bug
Components: core
Affects Versions: 0.8
Reporter: David Lao
Attachments: kafka.log
I'm encountering a case where broker get stuck due to HighwaterMarkCheckpoint
failing to recover from reading what appear to be corrupted isr entries. Once
in this state, leader election can never succeed and hence stalling the entire
cluster.
Please see the detailed stack trace from the attached log. Perhaps failing
fast when HighwaterMarkCheckpoint fails to read would force the broker to
restart and recover.
--
This message was sent by Atlassian JIRA
(v6.1#6144)