vinothchandar opened a new issue, #13922: URL: https://github.com/apache/hudi/issues/13922
### Feature Description **What the feature achieves:** See RFC-80 https://github.com/apache/hudi/blob/master/rfc/rfc-80/rfc-80.md **Why this feature is needed:** Many streaming and ML use-cases employ wide tables to store 1000s of columns in a single table. Hudi storage must evolve to support these needs -- while keeping write amplification and table service efficiency ### User Experience **How users will use this feature:** Writers should be able to get automatic, intelligent splitting of columns across column groups. Table services must be able to change column groups within a file group. For advanced writers for e.g streaming jobs that specifically want to separate the columns they write to, users need to be able to control the mappings of columns to column groups for a given write operation. - Configuration changes needed: WIP - API changes: WIP - Usage examples: WIP ### Hudi RFC Requirements **RFC PR link:** (if applicable) https://github.com/apache/hudi/blob/master/rfc/rfc-80/rfc-80.md **Why RFC is/isn't needed:** - Does this change public interfaces/APIs? Yes - Does this change storage format? Yes -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
