danny0405 commented on issue #7836: URL: https://github.com/apache/hudi/issues/7836#issuecomment-1421946941
Hudi does not track the record versoning in per-record level, each record has a metadata field to bookeep the commit time, but in general, these files are by default always exposed as the latest version in the `SNAPSHOT` query view, and there is a terminology named `TimeLine` in Hudi to keep track all the active history versions(versions that are more earlier thant the active instants are cleaned), if you want to do a timetravel query, just specify the query end_time. For your example, you can query the record in history version by specifying the end_time as `20230202164941321`, but make sure that this instant is still active on the timeline, and the cleaning strategy does not take effect on this instant timestamp. Some documents: Timeline: https://hudi.apache.org/docs/timeline TimeTravel: https://hudi.apache.org/docs/quick-start-guide/#point-in-time-query Cleaning: https://hudi.apache.org/docs/hoodie_cleaner -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
