[
https://issues.apache.org/jira/browse/KAFKA-1499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14154336#comment-14154336
]
Jay Kreps commented on KAFKA-1499:
----------------------------------
Yeah I totally agree.
I agree that some heuristic that worked batch-by-batch might be okay, I hadn't
thought of that. Actually though I think the main motivation for this feature
was to fix the compaction issue, so if that is an okay fix just doing that
would be an alternative.
I also agree that NoCompressionCodec should be the default and unless people
know about the change they will surely be confused by this switch. However I
claim this is a temporary confusion based on the fact that previously Kafka
compression worked one way and now it will work a new way. Plus they will in
any case have this confusion if they turn on the feature. For any new user the
configuration docs will all be updated and in the process of learning how to
turn on compression they will learn how it works. I think we could help this
with good release notes (when doing an upgrade people always read that to
ensure it is in-place compatible).
I guess in the end what I am arguing is that we should make a choice. Either a
single compression codec per topic is better and it should work that way or
else having the producer specify compression is better and it should work that
way. Giving the user the choice seems nice but it actually just adds complexity
since now we will always have to document and explain both and tell people
about the configuration knob to choose and then advise them on how to best make
the choice (and then debug when they get lost in all this). If we think the
right choice is very situation specific (in situation x, chose
broker.compression.enabled=true, in situation y chose false) then okay maybe we
need a config, but then let's figure out what the situations you want one
versus the other. If it isn't situation specific we should just choose one and
implement and document that.
> Broker-side compression configuration
> -------------------------------------
>
> Key: KAFKA-1499
> URL: https://issues.apache.org/jira/browse/KAFKA-1499
> Project: Kafka
> Issue Type: New Feature
> Reporter: Joel Koshy
> Assignee: Manikumar Reddy
> Labels: newbie++
> Fix For: 0.8.2
>
> Attachments: KAFKA-1499.patch, KAFKA-1499.patch,
> KAFKA-1499_2014-08-15_14:20:27.patch, KAFKA-1499_2014-08-21_21:44:27.patch,
> KAFKA-1499_2014-09-21_15:57:23.patch, KAFKA-1499_2014-09-23_14:45:38.patch,
> KAFKA-1499_2014-09-24_14:20:33.patch, KAFKA-1499_2014-09-24_14:24:54.patch,
> KAFKA-1499_2014-09-25_11:05:57.patch
>
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> A given topic can have messages in mixed compression codecs. i.e., it can
> also have a mix of uncompressed/compressed messages.
> It will be useful to support a broker-side configuration to recompress
> messages to a specific compression codec. i.e., all messages (for all
> topics) on the broker will be compressed to this codec. We could have
> per-topic overrides as well.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)