bhat-vinay opened a new pull request, #10625: URL: https://github.com/apache/hudi/pull/10625
… defined through options Initial commit. Supports the following features: 1. Modify schema to ass secondary index to metadata 2. New partition type in the metadata table to store secondary_keys-to-record_keys mapping 3. Various options to support secondary index enablement, column mappings (for secondary keys) etc 4. Initialization of secondary keys 5. Update secondary keys on inserts/upserts Supports only one secondary index at the moment. The PR is still a WIP and needs more work to handle deletions, proper merging, compaction, (re) clustering among other things. ### Change Logs Initial commit. Supports the following features: 1. Modify schema to ass secondary index to metadata 2. New partition type in the metadata table to store secondary_keys-to-record_keys mapping 3. Various options to support secondary index enablement, column mappings (for secondary keys) etc 4. Initialization of secondary keys 5. Update secondary keys on inserts/upserts Supports only one secondary index at the moment. The PR is still a WIP and needs more work to handle deletions, proper merging, compaction, (re) clustering among other things. ### Impact Support secondary index on columns (similar to record index, but for non-unique columns) ### Risk level (write none, low medium or high below) Medium. New and existing tests ### Documentation Update NA. Will be done later ### Contributor's checklist - [ ] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute) - [ ] Change Logs and Impact were stated clearly - [ ] Adequate tests were added if applicable - [ ] CI passed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
