Michael Coon created KAFKA-3534:
-----------------------------------
Summary: Deserialize on demand when default time extractor used
Key: KAFKA-3534
URL: https://issues.apache.org/jira/browse/KAFKA-3534
Project: Kafka
Issue Type: Improvement
Components: streams
Affects Versions: 0.9.0.1
Reporter: Michael Coon
Assignee: Guozhang Wang
Priority: Minor
When records are added to the RecordQueue, they are deserialized at that time
in order to extract the timestamp. But for some data flows where large messages
are consumed (particularly compressed messages), this can result in large
spikes in memory as all messages must be deserialized prior to processing (and
getting out of memory). An optimization might be to only require
deserialization at this stage if a non-default timestamp extractor is being
used.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)