Chia-Ping Tsai created KAFKA-20036:
--------------------------------------

             Summary: Handle LogCleaner segment overflow caused by compression 
level changes
                 Key: KAFKA-20036
                 URL: https://issues.apache.org/jira/browse/KAFKA-20036
             Project: Kafka
          Issue Type: Bug
            Reporter: Chia-Ping Tsai
            Assignee: Chia-Ping Tsai


from: [https://github.com/apache/kafka/pull/15516#discussion_r2649600413]

The {{LogCleaner}} can fail with {{java.lang.IllegalArgumentException}} when 
the compacted data exceeds the segment size limit.

This issue occurs because the cleaner rebuilds batches using the default 
compression level. If the data was originally ingested with a higher 
compression level, the re-compressed batch size increases, causing the 
cleaner's size estimation to be inaccurate and leading to an overflow.

To address this without major architectural changes (like splitting segments 
during cleaning), we should introduce a safety margin or adaptive logic for 
partitions that encounter this overflow.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to