stayrascal edited a comment on issue #4030:
URL: https://github.com/apache/hudi/issues/4030#issuecomment-1022865829


   > 
   
   Thanks for replay.
   
   I'm thinking how about change the ValueState of  `BucketAssignFunction` by 
store the whole `HoodieRecord` instead of `HoodieRecordGlobalLocation`, and 
then once the partition path changed, beside output a delete record on old 
file(partition), and also we can update the partition path of old record, and 
output the updated record on new file(partition).
   
   And later, the incoming record and updated record(from old record) will be 
`#preCombine` in `StreamWriteFunction`.
   
   Not sure is there any other impact that we use ValueState to store whole 
HoodieRecord instead of `HoodieRecordGlobalLocation` except the state size will 
bigger?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to