[
https://issues.apache.org/jira/browse/KAFKA-1499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14162336#comment-14162336
]
Joel Koshy commented on KAFKA-1499:
-----------------------------------
Talked to Jay offline and here is a summary of what we discussed: the main
motivations for this feature (currently) are:
# Log compaction
# Ensuring messages on the broker are compressed if a bunch of producers happen
to send messages uncompressed - say if all producers in an organization
happened to pick up a bad config over time
# Ensuring messages on the broker are compressed with a specific compression
type - perhaps if downstream consumers want only that compression type
For the first use-case, we can potentially get around it as described above by
picking any compression type - i.e., if we are writing out a batch of messages
that contains various compression types we can just pick one of those types.
This is not as neat as having an explicit target compression type but it seems
reasonable.
The second and third use-cases are likely only marginally useful.
So we have a couple of options:
* Do nothing - given that we have (what seems to be) a reasonable approach for
dealing log compaction. i.e., we can table this and revisit if we have a very
compelling use case for it.
* Add the compression.type config as a server config and topic-override config.
** In order to address the concern of forgetting to set this (or
misconfiguration) there are two approaches:
*** Make it an optional configuration as mentioned further above
*** Have it default to compression.type "producer" - which means use whatever
compression type the producer used.
> Broker-side compression configuration
> -------------------------------------
>
> Key: KAFKA-1499
> URL: https://issues.apache.org/jira/browse/KAFKA-1499
> Project: Kafka
> Issue Type: New Feature
> Reporter: Joel Koshy
> Assignee: Manikumar Reddy
> Labels: newbie++
> Fix For: 0.8.2
>
> Attachments: KAFKA-1499.patch, KAFKA-1499.patch,
> KAFKA-1499_2014-08-15_14:20:27.patch, KAFKA-1499_2014-08-21_21:44:27.patch,
> KAFKA-1499_2014-09-21_15:57:23.patch, KAFKA-1499_2014-09-23_14:45:38.patch,
> KAFKA-1499_2014-09-24_14:20:33.patch, KAFKA-1499_2014-09-24_14:24:54.patch,
> KAFKA-1499_2014-09-25_11:05:57.patch
>
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> A given topic can have messages in mixed compression codecs. i.e., it can
> also have a mix of uncompressed/compressed messages.
> It will be useful to support a broker-side configuration to recompress
> messages to a specific compression codec. i.e., all messages (for all
> topics) on the broker will be compressed to this codec. We could have
> per-topic overrides as well.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)