[
https://issues.apache.org/jira/browse/FLINK-27343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
pengyusong updated FLINK-27343:
-------------------------------
Description:
* situation one
when i use flink sql kafka connector re-consume a topic, the topic already
has many messages.
jdbc sink param with default.
kafka topic is a compact topic, which contents is a mysql table cdc events.
there some records with same key in one batch, buffer within one batch,
finnaly sink to postgres with unorder, later record in the buffer batch are
executed first.
this will lead to the older message in kafka deal with after the newer
message, the results are inconsistent with kafka message orders.
* situation two
If i set
h5. sink.buffer-flush.interval = 0
h5. sink.buffer-flush.max-rows = 1
the result are inconsistent with kafka message orders.
So, I have a suspicion that the order in jdbc buffer execute is
non-deterministic, lead to result in jdbc unordered.
updated!!!
I found the order is my left join operator disorder the record order. The
question is left join why disorder the order
was:
* situation one
when i use flink sql kafka connector re-consume a topic, the topic already
has many messages.
jdbc sink param with default.
kafka topic is a compact topic, which contents is a mysql table cdc events.
there some records with same key in one batch, buffer within one batch,
finnaly sink to postgres with unorder, later record in the buffer batch are
executed first.
this will lead to the older message in kafka deal with after the newer
message, the results are inconsistent with kafka message orders.
* situation two
If i set
h5. sink.buffer-flush.interval = 0
h5. sink.buffer-flush.max-rows = 1
the result are inconsistent with kafka message orders.
So, I have a suspicion that the order in jdbc buffer execute is
non-deterministic, lead to result in jdbc unordered.
> flink jdbc sink will lead to unordered result, because the sink buffer
> records execute unorder
> ----------------------------------------------------------------------------------------------
>
> Key: FLINK-27343
> URL: https://issues.apache.org/jira/browse/FLINK-27343
> Project: Flink
> Issue Type: Improvement
> Components: Connectors / JDBC
> Affects Versions: 1.13.6
> Environment: flink 1.13.6
> kafka
> postgres jdbc sink
> Reporter: pengyusong
> Priority: Critical
>
> * situation one
> when i use flink sql kafka connector re-consume a topic, the topic
> already has many messages.
> jdbc sink param with default.
> kafka topic is a compact topic, which contents is a mysql table cdc
> events.
> there some records with same key in one batch, buffer within one batch,
> finnaly sink to postgres with unorder, later record in the buffer batch are
> executed first.
> this will lead to the older message in kafka deal with after the newer
> message, the results are inconsistent with kafka message orders.
> * situation two
> If i set
> h5. sink.buffer-flush.interval = 0
> h5. sink.buffer-flush.max-rows = 1
> the result are inconsistent with kafka message orders.
>
> So, I have a suspicion that the order in jdbc buffer execute is
> non-deterministic, lead to result in jdbc unordered.
>
> updated!!!
> I found the order is my left join operator disorder the record order. The
> question is left join why disorder the order
--
This message was sent by Atlassian Jira
(v8.20.7#820007)