vinishjail97 commented on PR #13925: URL: https://github.com/apache/hudi/pull/13925#issuecomment-3349567658
@yihua Need help on the following to improve this PR. 1. There's a need for a better interface which extends `org.apache.hudi.table.action.commit.Partitioner` having methods like getBucketInfo etc. used by both spark/java/flink engines. I have added a TODO interface to get basic tests passing, can you let me know if there's a better way to reduce duplicate code between spark/java and other engines? 2. HoodieBucketIndex supports tagging for INSERT and BULK_INSERT operations but enforces the presence of a recordKey in `validateBucketIndexConfig`. Can we remove this to unblock HoodieBucketIndex use-cases for append-only tables? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
