[I] [Umbrella] Support full scan in batch mode for PrimaryKey Table [fluss]

via GitHub Mon, 27 Oct 2025 23:39:18 -0700


platinumhamburg opened a new issue, #1876:
URL: https://github.com/apache/fluss/issues/1876


   ### Search before asking
   
   - [x] I searched in the [issues](https://github.com/apache/fluss/issues) and 
found nothing similar.
   
   
   ### Motivation
   
   Currently, Fluss does not support performing full snapshot scans of the 
latest data in batch-execution mode on Primary-Key tables. However, this 
capability is critically important in many scenarios, such as OLAP queries or 
ad-hoc data inspection. 
   The existing LimitBatchScanner does not support full snapshot inspection, 
and the KvSnapshotBatchScanner does not support ad-hoc queries on the current 
dataset snapshot. Neither of them meets the requirements.
   
   ### Solution
   
   The above features can be separated into two subtasks:
   - [ ] Fluss supports ad-hoc full snapshot scanning for Primary Key Tables
   - [ ] Flink Integration: support non-limited scanning for Primary Key Tables 
in batch execution mode
   
   ### Anything else?
   
   The underlying implementation of the newly introduced BatchScanner should 
follow the streaming fetch mode to avoid scanning large datasets in a single 
RPC communication.
   
   ### Willingness to contribute
   
   - [ ] I'm willing to submit a PR!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] [Umbrella] Support full scan in batch mode for PrimaryKey Table [fluss]

Reply via email to