danny0405 commented on PR #19005:
URL: https://github.com/apache/hudi/pull/19005#issuecomment-4764071146

   > Partial updates: only the changed columns come back (the other columns are 
null/garbled), because a partial-update log block holds only the changed 
columns and the base-file row is dropped before the runtime merge.
   EVENT_TIME_ORDERING (even without partial updates): a window write with a 
lower ordering value can surface even though the existing higher-ordering 
version should win.
   
   well, I'm not sure whether we should do this, the incremental queries should 
generally return new changes instead of merging with all history dataset from 
my understanding, becase the merging always has a window and the dataset is 
evolving all the time, there is no clear definition what the span the window 
should be.
   
   if the per-commit merging is required, let's suggest the user to use the COW 
table, and the incremental queries should return the right datasets.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to