kaijiezhang0319 opened a new issue #4058: URL: https://github.com/apache/iceberg/issues/4058
Hello Iceberg folks, I have a blew scenario and want to get some ideas on how to solve it. I have 2 iceberg table, 1st named "source_table" and 2nd named "target_table". I am doing some data transform job among these 2 iceberg table (load data from "source_table", then apply a SQL, final write to "target_table") and this job is run periodically(eg: 1 hr). is there any way that i can maintain the snapshot id relationship between these 2 table ? (in above data transform job, each time, "target_table" would have a new snapshot id generated, 'target_snapid', the snapshot id used for load "source_table" is "source_snapid", then "source_snapid" and "target_snapid" is mapped). one thing can guarantee is : for "target_table", only above data transform job 1 writer, similar , "source_table" also only have 1 writer. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
