liming30 opened a new pull request, #2221: URL: https://github.com/apache/incubator-paimon/pull/2221
### Purpose <!-- Linking this pull request to the issue --> Linked issue: close #2157 <!-- What is the purpose of the change --> [flink] support the feature of ContinuousFileStoreSource using consumer-id. `Enumerator` will assign split to the `reader` in the order of snapshots, and the `reader` will also consume in the order of snapshots. Therefore, the consumption progress of a single reader is: 1. If the reader is in the `enumerator's` waiting list, then the consumption progress of the `reader` is equal to the snapshot assigned by `splitAssigner` next time. If there is no split in `splitAssigner`, it is equal to the snapshot of the next scan by enumerator. 2. If the reader is not in the `enumerator's` waiting list, the `reader's` consumption progress is equal to the greater of the snapshot consumed by the consumer and the snapshot recently assigned to it by `splitAssigner`. The minimum value of the consumption progress of all readers is the current position of the `consumer`. ### Tests <!-- List UT and IT cases to verify this change --> 1. `org.apache.paimon.flink.source.ContinuousFileSplitEnumeratorTest#testEnumeratorWithConsumer` 2. `org.apache.paimon.flink.source.FileStoreSourceReaderTest#testReaderOnSplitFinished` ### API and Format <!-- Does this change affect API or storage format --> No ### Documentation <!-- Does this change introduce a new feature --> No -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
