tsreaper opened a new pull request, #2319:
URL: https://github.com/apache/incubator-paimon/pull/2319

   ### Purpose
   
   Currently many C++ big data systems (for example StarRocks) have supported 
Paimon. However as Paimon's data files use the LSM tree structure, these data 
files must be merged before reading, and these C++ systems have to use Paimon's 
Java reader. This affects the performance a lot.
   
   However, under certain circumstances (for example, when a table is fully 
compacted) the data files can be read without merging. This PR introduces a new 
public API called `RawTableFile` to indicate such files. API users can try to 
extract `RawTableFile` from `DataSplit` and directly read the files with their 
own very fast file reader.
   
   ### Tests
   
   * `DataSplitTest`.
   
   ### API and Format
   
   Yes. It introduces a new public API.
   
   ### Documentation
   
   No. This change should only be used by other big data system developers, not 
users.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to