Zouxxyy opened a new pull request, #7123:
URL: https://github.com/apache/paimon/pull/7123

   <!-- Please specify the module before the PR name: [core] ... or [flink] ... 
-->
   
   ### Purpose
   
   There are two main reasons for this change:
   
   1. `blob-as-descriptor` is highly ambiguous during writes—it actually means 
that the input for writing is a descriptor, not that the blob itself is being 
written as a descriptor.
   
   2. A single configuration cannot adequately serve both use cases. I believe, 
the most common scenario should be:
   
     - `write-blob-from-descriptor=true` so that data is loaded at write time 
which is extremely memory-efficient. This maybe can be set to default true, in 
fact, there has been discussion on this topic: 
https://github.com/apache/paimon/pull/7021
     - `read-blob-as-descriptor=false` since in most read scenarios, we need 
the original raw data rather than a descriptor.
   
   ### Tests
   
   <!-- List UT and IT cases to verify this change -->
   
   ### API and Format
   
   <!-- Does this change affect API or storage format -->
   
   ### Documentation
   
   <!-- Does this change introduce a new feature -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to