Sergei Morozov created FLINK-38334:
--------------------------------------

             Summary: MySQL CDC source may get stuck in the INITIAL_ASSIGNING 
state
                 Key: FLINK-38334
                 URL: https://issues.apache.org/jira/browse/FLINK-38334
             Project: Flink
          Issue Type: Bug
          Components: Flink CDC
    Affects Versions: cdc-3.2.0
            Reporter: Sergei Morozov


The logic of filtering out irrelevant snapshot splits on the enumerator and the 
source reader is inconsistent:
 # The enumerator filters them out only if the assigner is in the 
INITIAL_ASSIGNING_FINISHED or the NEWLY_ADDED_ASSIGNING_FINISHED state 
([source|https://github.com/apache/flink-cdc/blob/release-3.2.0/flink-cdc-connect/flink-cdc-source-connectors/flink-connector-mysql-cdc/src/main/java/org/apache/flink/cdc/connectors/mysql/source/assigners/MySqlSnapshotSplitAssigner.java#L219]).
 # The source reader filters them out regardless of the assigner state.

As a result, if a table is excluded from the source configuration during the 
snapshot of that table, the source reader will drop its splits while the 
enumerator will expect them to be processed by the source reader and reported 
back. As a result, the source gets stuck in the snapshotting phase and never 
transitions to streaming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to