Timo Walther created FLINK-23426:
------------------------------------
Summary: Support changelog processing in batch mode
Key: FLINK-23426
URL: https://issues.apache.org/jira/browse/FLINK-23426
Project: Flink
Issue Type: Sub-task
Components: Table SQL / API
Reporter: Timo Walther
The DataStream API can execute arbitrary DataStream programs when running in
batch mode. However, this is not the case for the Table API batch mode. E.g. a
source with non-insert only changes is not supported and updates/deletes cannot
be emitted.
In theory, we could make this work by running the "stream mode" of the planner
(CDC transformations) on top of the "batch mode" of DataStream API (specialized
state backend, sorted inputs). It is up for discussion if and how we expose
such functionality.
If we don't allow enabling incremental updates, we can also add a special batch
operator that materializes the incoming changes for a batch pipeline. However,
it would require "complete" CDC logs (i.e. no missing UPDATE_AFTER).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)