the-other-tim-brown opened a new issue, #18112:
URL: https://github.com/apache/hudi/issues/18112

   ### Task Description
   
   **What needs to be done:**
   The writer path needs to be updated to include the ability to write out the 
incoming, inline blob data to a container file that will hold one or more of 
these byte arrays. The decision of how to store the data (inline vs 
out-of-line) will be determined by the configurations described in the RFC. 
When a file is created by the Hudi writer, it is automatically considered a 
managed file.
   
   **Why this task is needed:**
   Allows the user to easily offload large byte arrays to files to reduce the 
storage size for the structured columns in their dataset.
   
   ### Task Type
   
   Code improvement/refactoring
   
   ### Related Issues
   
   **Parent feature issue:** (if applicable )
   **Related issues:**
   NOTE: Use `Relationships` button to add parent/blocking issues after issue 
is created.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to