nsivabalan commented on issue #3394:
URL: https://github.com/apache/hudi/issues/3394#issuecomment-895408837


   There are some  nuances here. Ignore the global, different partitions for 
now. Just consider how to reconcile two records in general. 
   
   I guess you know what preCombine is used for (which is used to combine two 
records within same incoming batch of write). 
   But to reconcile an incoming record with one already on storage, Hudi relies 
on HoodieRecordPayload.combineAndGetUpdateValue(). 
   
   Most commonly used payload impl is OverwriteWithLatestAvroPayload. So, this 
will always choose the latest incoming record over whats in storage. 
   
   But recently we also added another payload impl called 
DefaultHoodieRecordPayload. This  payload will honor preCombine field while 
reconciling an incoming record with whats in storage using the preCombine field 
value(within combineAndGetUpdateValue()). 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to