xvrl opened a new pull request #10730:
URL: https://github.com/apache/druid/pull/10730


   Today Kafka message support in streaming indexing tasks is limited to
   message values, and does not provide a way to expose Kafka headers,
   timestamps, or keys, which may be of interest to more specialized
   Druid input formats. For instance, Kafka headers may be used to indicate
   payload format/encoding or additional metadata, and timestamps are often
   omitted from values in Kafka streams applications, since they are
   included in the record.
   
   This change proposes to introduce KafkaRecordEntity as InputEntity,
   which would give input formats full access to the underlying Kafka record,
   including headers, key, timestamps. It would also open access to low-level
   information such as topic, partition, offset if needed.
   
   KafkaEntity is a subclass of ByteEntity for backwards compatibility with
   existing input formats, and to avoid introducing unnecessary complexity
   for Kinesis indexing tasks.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to