stayrascal edited a comment on issue #4030: URL: https://github.com/apache/hudi/issues/4030#issuecomment-1022865829
> Thanks for replay. I'm thinking how about change the ValueState of `BucketAssignFunction` by store the whole `HoodieRecord` instead of `HoodieRecordGlobalLocation`, and then once the partition path changed, beside output a delete record on old file(partition), and also we can update the partition path of old record, and output the updated record on new file(partition). And later, the incoming record and updated record(from old record) will be `#preCombine` in `StreamWriteFunction`. Not sure is there any other impact that we use ValueState to store whole HoodieRecord instead of `HoodieRecordGlobalLocation` except the state size will bigger? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
