Luke Chen created KAFKA-18930:
---------------------------------

             Summary: KRaft MigrationEvent won't retry when failing to write 
data to ZK 
                 Key: KAFKA-18930
                 URL: https://issues.apache.org/jira/browse/KAFKA-18930
             Project: Kafka
          Issue Type: Improvement
    Affects Versions: 3.9.0
            Reporter: Luke Chen


When running ZK migrating to KRaft, there will be a dual-write mode. In that 
mode, metadata will write to KRaft, then write to ZK asynchronously. When 
there's some exception, KRaft MigrationEvent won't retry when failing to write 
data to ZK. That causes metadata inconsistency between KRaft and ZK.

 

Besides, when doing KRaft controller clean shutdown, we should keep retrying 
the failing ZK writing until force shutdown, to make sure the metadata is 
consistent.

 

The impact is when rolling back to ZK mode during migration, the metadata in ZK 
is out of date



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to