kaijiezhang0319 opened a new issue #4058:
URL: https://github.com/apache/iceberg/issues/4058


   Hello Iceberg folks,  I have a blew scenario and want to get some ideas on 
how to solve it. 
   
   I have 2 iceberg table, 1st named "source_table" and 2nd named 
"target_table". I am doing some data transform job among these 2 iceberg table 
(load data from "source_table", then apply a SQL, final write to 
"target_table") and this job is run periodically(eg: 1 hr).   is there any way 
that i can maintain the snapshot id relationship between these 2 table ?  (in 
above data transform job, each time, "target_table" would have a new snapshot 
id generated, 'target_snapid',   the snapshot id used for load "source_table" 
is "source_snapid",  then "source_snapid" and "target_snapid" is mapped). 
   
   one thing can guarantee is : for "target_table", only above data transform 
job 1 writer,  similar , "source_table" also only have 1 writer. 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to