Luke Chen created KAFKA-18930:
---------------------------------
Summary: KRaft MigrationEvent won't retry when failing to write
data to ZK
Key: KAFKA-18930
URL: https://issues.apache.org/jira/browse/KAFKA-18930
Project: Kafka
Issue Type: Improvement
Affects Versions: 3.9.0
Reporter: Luke Chen
When running ZK migrating to KRaft, there will be a dual-write mode. In that
mode, metadata will write to KRaft, then write to ZK asynchronously. When
there's some exception, KRaft MigrationEvent won't retry when failing to write
data to ZK. That causes metadata inconsistency between KRaft and ZK.
Besides, when doing KRaft controller clean shutdown, we should keep retrying
the failing ZK writing until force shutdown, to make sure the metadata is
consistent.
The impact is when rolling back to ZK mode during migration, the metadata in ZK
is out of date
--
This message was sent by Atlassian Jira
(v8.20.10#820010)