[GitHub] [iceberg] kaijiezhang0319 opened a new issue #4058: Relationship between 2 snapshot id during data transform.

GitBox Mon, 07 Feb 2022 18:00:48 -0800


kaijiezhang0319 opened a new issue #4058:
URL: https://github.com/apache/iceberg/issues/4058



   Hello Iceberg folks,  I have a blew scenario and want to get some ideas on 
how to solve it. 
   
   I have 2 iceberg table, 1st named "source_table" and 2nd named 
"target_table". I am doing some data transform job among these 2 iceberg table 
(load data from "source_table", then apply a SQL, final write to 
"target_table") and this job is run periodically(eg: 1 hr).   is there any way 
that i can maintain the snapshot id relationship between these 2 table ?  (in 
above data transform job, each time, "target_table" would have a new snapshot 
id generated, 'target_snapid',   the snapshot id used for load "source_table" 
is "source_snapid",  then "source_snapid" and "target_snapid" is mapped). 
   
   one thing can guarantee is : for "target_table", only above data transform 
job 1 writer,  similar , "source_table" also only have 1 writer. 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [iceberg] kaijiezhang0319 opened a new issue #4058: Relationship between 2 snapshot id during data transform.

Reply via email to