danny0405 commented on PR #19005: URL: https://github.com/apache/hudi/pull/19005#issuecomment-4764071146
> Partial updates: only the changed columns come back (the other columns are null/garbled), because a partial-update log block holds only the changed columns and the base-file row is dropped before the runtime merge. EVENT_TIME_ORDERING (even without partial updates): a window write with a lower ordering value can surface even though the existing higher-ordering version should win. well, I'm not sure whether we should do this, the incremental queries should generally return new changes instead of merging with all history dataset from my understanding, becase the merging always has a window and the dataset is evolving all the time, there is no clear definition what the span the window should be. if the per-commit merging is required, let's suggest the user to use the COW table, and the incremental queries should return the right datasets. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
