vinothchandar opened a new issue, #13922:
URL: https://github.com/apache/hudi/issues/13922

   ### Feature Description
   
   **What the feature achieves:**
   
   See  RFC-80 
   
   https://github.com/apache/hudi/blob/master/rfc/rfc-80/rfc-80.md 
   
   
   **Why this feature is needed:**
   
   Many streaming and ML use-cases employ wide tables to store 1000s of columns 
in a single table. Hudi storage must evolve to support these needs -- while 
keeping write amplification and table service efficiency 
   
   ### User Experience
   
   **How users will use this feature:**
   
   Writers should be able to get automatic, intelligent splitting of columns 
across column groups. Table services must be able to change column groups 
within a file group. For advanced writers for e.g streaming jobs that 
specifically want to separate the columns they write to, users need to be able 
to control the mappings of columns to column groups for a given write operation.
   
   - Configuration changes needed: WIP
   - API changes: WIP
   - Usage examples: WIP
   
   
   ### Hudi RFC Requirements
   
   **RFC PR link:** (if applicable)
   https://github.com/apache/hudi/blob/master/rfc/rfc-80/rfc-80.md
   
   **Why RFC is/isn't needed:**
   - Does this change public interfaces/APIs? Yes
   - Does this change storage format? Yes
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to