leaves12138 opened a new issue, #6022: URL: https://github.com/apache/paimon/issues/6022
### Search before asking - [x] I searched in the [issues](https://github.com/apache/paimon/issues) and found nothing similar. ### Motivation <h1>Motivation</h1><p>In batch mode, "MERGE INTO" or "UPDATE" operations frequently occur. Rows may variety from snapshots. We have no idea how to track the row variation.</p><p>It's useful to find the whole row lineage for user if they must track the source for one abnormal record. For example, if we meet one abnormal record, we may wanna</p><p>how it produce and how it changes to today we see. To better support these, we would like to add a new table function to tag the datas' lineage.</p><p><br /></p><p>The concept is already used in several other data storage systems like:</p> System | Link -- | -- Iceberg | https://iceberg.apache.org/spec/?h=row+lineage#row-lineage DeltaLake | https://docs.delta.io/latest/delta-row-tracking.html SnowFlake | https://docs.snowflake.com/en/user-guide/ui-snowsight-lineage ### Solution _No response_ ### Anything else? _No response_ ### Are you willing to submit a PR? - [ ] I'm willing to submit a PR! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@paimon.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org