liming30 opened a new pull request, #2221:
URL: https://github.com/apache/incubator-paimon/pull/2221

   
   
   ### Purpose
   
   <!-- Linking this pull request to the issue -->
   Linked issue: close #2157 
   
   <!-- What is the purpose of the change -->
   [flink] support the feature of ContinuousFileStoreSource using consumer-id.
   
   `Enumerator` will assign split to the `reader` in the order of snapshots, 
and the `reader` will also consume in the order of snapshots. Therefore, the 
consumption progress of a single reader is:
   
   1. If the reader is in the `enumerator's` waiting list, then the consumption 
progress of the `reader` is equal to the snapshot assigned by `splitAssigner` 
next time. If there is no split in `splitAssigner`, it is equal to the snapshot 
of the next scan by enumerator.
   2. If the reader is not in the `enumerator's` waiting list, the `reader's` 
consumption progress is equal to the greater of the snapshot consumed by the 
consumer and the snapshot recently assigned to it by `splitAssigner`.
   
   The minimum value of the consumption progress of all readers is the current 
position of the `consumer`.
   
   ### Tests
   
   <!-- List UT and IT cases to verify this change -->
   
   1. 
`org.apache.paimon.flink.source.ContinuousFileSplitEnumeratorTest#testEnumeratorWithConsumer`
   2. 
`org.apache.paimon.flink.source.FileStoreSourceReaderTest#testReaderOnSplitFinished`
   
   ### API and Format
   
   <!-- Does this change affect API or storage format -->
   No
   ### Documentation
   
   <!-- Does this change introduce a new feature -->
   No


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to