This patch fixes the inconsistent handling of out of range errors in the 
replica fetcher. Previously we would raise a fatal error if the follower's 
offset is ahead of the leader's and unclean leader election is not enabled. The 
behavior was inconsistent depending on the message format. With 
KIP-101/KIP-279, upon becoming a follower, the replica would use leader epoch 
information to reconcile the end of the log with the leader and simply 
truncate. Additionally, with the old format, the check is not really 
bulletproof for detecting data loss since the unclean leader's end offset might 
have already caught up to the follower's offset at the time of its initial 
fetch or when it queries for the current log end offset.

With this patch, we simply skip the unclean leader election check and allow the 
needed truncation to occur. When the truncation offset is below the high 
watermark, a warning will be logged. This makes the behavior consistent for all 
message formats and removes a scenario in which an error on one partition can 
bring the broker down.

### Committer Checklist (excluded from commit message)
- [ ] Verify design and implementation 
- [ ] Verify test coverage and CI build status
- [ ] Verify documentation (including upgrade notes)


[ Full content available at: https://github.com/apache/kafka/pull/5654 ]
This message was relayed via gitbox.apache.org for [email protected]

Reply via email to