[
https://issues.apache.org/jira/browse/FLINK-36611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ruan Hang updated FLINK-36611:
------------------------------
Fix Version/s: cdc-3.4.0
(was: cdc-3.3.0)
> Add schema info to output of Kafka sink
> -----------------------------------------
>
> Key: FLINK-36611
> URL: https://issues.apache.org/jira/browse/FLINK-36611
> Project: Flink
> Issue Type: New Feature
> Components: Flink CDC
> Affects Versions: cdc-3.3.0
> Reporter: Yanquan Lv
> Priority: Major
> Labels: pull-request-available
> Fix For: cdc-3.4.0
>
>
> Currently, the output of Kafka sink in debezium format looks like this:
> {code:java}
> {
> "before": {
> "id": 4,
> "name": "John",
> "address": "New York",
> "phone_number": "2222",
> "age": 12
> },
> "after": {
> "id": 4,
> "name": "John",
> "address": "New York",
> "phone_number": "1234",
> "age": 12
> },
> "op": "u",
> "source": {
> "db": null,
> "table": "customers"
> }
> } {code}
> It contains record data with full before/after and db info, but schema info
> wasn't included.
> However, In some scenarios, we need this information to determine the type of
> data. For example, Paimon's Kafka CDC source requires this type information,
> otherwise all types are considered String, refer to
> [https://paimon.apache.org/docs/0.9/flink/cdc-ingestion/kafka-cdc/#supported-formats.]
> Considering that this will increase the data load, I suggest adding a
> parameter to configure whether to enable it.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)