Hi Dongjin,

Hi Kafka devs,

Thanks a lot for opening this KIP — and hats off for the amount of
benchmarking and investigation you’ve done! It’s great to see a follow-up
to KIP-390 that digs deeper into these compression-level options with solid
data to back it.

One thing I wanted to clarify: what specific compression options are we
targeting here? From what I saw in the related PR [1], it seems we’re
mostly exposing block and window sizes. But many codecs expose more than
that:


   -

   *GZIP* has options like strategy, window size, and buffer size
   -

   *LZ4* supports block size (64KB–4MB), block mode (independent vs
   linked), checksums, and dictionaries
   -

   *Snappy,* as far as I know, doesn’t expose much for tuning

   -

   *ZSTD* has a huge set: threading, window size, block size, dictionaries,
   long-distance matching, checksums, etc. It’s a beast in terms of
   configurability 😄

So I’m curious — is the intent of this KIP to eventually support a broader
set of codec-specific settings, or are we intentionally scoping it down to
just block/window size for now?

Also, just to check — are you still interested in implementing this KIP
(i.e., KIP-780)? If not, would you be open to me taking it over or helping
move it forward? Of course, only if that works for you — I’d be happy to
coordinate if there’s still interest in pursuing this.

Looking forward to your thoughts!

Best,

Maros Orsak

[1] - https://github.com/apache/kafka/pull/11388/files

Reply via email to