Sergei Morozov created FLINK-38334:
--------------------------------------
Summary: MySQL CDC source may get stuck in the INITIAL_ASSIGNING
state
Key: FLINK-38334
URL: https://issues.apache.org/jira/browse/FLINK-38334
Project: Flink
Issue Type: Bug
Components: Flink CDC
Affects Versions: cdc-3.2.0
Reporter: Sergei Morozov
The logic of filtering out irrelevant snapshot splits on the enumerator and the
source reader is inconsistent:
# The enumerator filters them out only if the assigner is in the
INITIAL_ASSIGNING_FINISHED or the NEWLY_ADDED_ASSIGNING_FINISHED state
([source|https://github.com/apache/flink-cdc/blob/release-3.2.0/flink-cdc-connect/flink-cdc-source-connectors/flink-connector-mysql-cdc/src/main/java/org/apache/flink/cdc/connectors/mysql/source/assigners/MySqlSnapshotSplitAssigner.java#L219]).
# The source reader filters them out regardless of the assigner state.
As a result, if a table is excluded from the source configuration during the
snapshot of that table, the source reader will drop its splits while the
enumerator will expect them to be processed by the source reader and reported
back. As a result, the source gets stuck in the snapshotting phase and never
transitions to streaming.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)