Tanay Tummalapalli created BEAM-7822:
----------------------------------------

             Summary: TriggerCopyJobs in BQ file loads is not atmomic in case 
of failure
                 Key: BEAM-7822
                 URL: https://issues.apache.org/jira/browse/BEAM-7822
             Project: Beam
          Issue Type: Test
          Components: io-python-gcp
    Affects Versions: 2.13.0
            Reporter: Tanay Tummalapalli


Scenario:
If temp_tables are being used, during copying data from the temp table to the 
destination table, if there is a failure in BigQuery mid-way through execution, 
it will raise an Exception causing the pipeline to fail. As a result, some 
temp_tables will be copied and some will not be. When the pipeline is rerun, it 
will cause the same data to be written to new temp_tables and copy jobs will be 
triggered to copy this data to the destination table. 

This will result in duplicate data being written to the BigQuery destination 
table.





--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to