[
https://issues.apache.org/jira/browse/BEAM-12482?focusedWorklogId=756634&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-756634
]
ASF GitHub Bot logged work on BEAM-12482:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 13/Apr/22 19:33
Start Date: 13/Apr/22 19:33
Worklog Time Spent: 10m
Work Description: MarcoRob opened a new pull request, #17365:
URL: https://github.com/apache/beam/pull/17365
As [PR-14113](https://github.com/apache/beam/pull/14113), adding a zero row
load job to bigquery to update the schema when using temporary tables before
temp tables copy data into the final destination table, so temp tables schema
matches the destination table.
------------------------
Thank you for your contribution! Follow this checklist to help us
incorporate your contribution quickly and easily:
- [ ] [**Choose
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA
issue, if applicable. This will automatically link the pull request to the
issue.
- [ ] Update `CHANGES.md` with noteworthy changes.
- [ ] If this contribution is large, please file an Apache [Individual
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
See the [Contributor Guide](https://beam.apache.org/contribute) for more
tips on [how to make review process
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
To check the build health, please visit
[https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md](https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md)
GitHub Actions Tests Status (on master branch)
------------------------------------------------------------------------------------------------
[](https://github.com/apache/beam/actions?query=workflow%3A%22Build+python+source+distribution+and+wheels%22+branch%3Amaster+event%3Aschedule)
[](https://github.com/apache/beam/actions?query=workflow%3A%22Python+Tests%22+branch%3Amaster+event%3Aschedule)
[](https://github.com/apache/beam/actions?query=workflow%3A%22Java+Tests%22+branch%3Amaster+event%3Aschedule)
See [CI.md](https://github.com/apache/beam/blob/master/CI.md) for more
information about GitHub Actions CI.
Issue Time Tracking
-------------------
Worklog Id: (was: 756634)
Time Spent: 4.5h (was: 4h 20m)
> BigQueryIO failed to load data to temp table when withSchemaUpdateOptions is
> set.
> ---------------------------------------------------------------------------------
>
> Key: BEAM-12482
> URL: https://issues.apache.org/jira/browse/BEAM-12482
> Project: Beam
> Issue Type: Bug
> Components: io-java-gcp
> Reporter: Siyuan Chen
> Assignee: Marco Robles
> Priority: P2
> Time Spent: 4.5h
> Remaining Estimate: 0h
>
> `withSchemaUpdateOptions` is only supported with WRITE_APPEND or
> WRITE_TRUNCATE with paritioned table. This is noted in the documentation:
> [https://github.com/apache/beam/blob/master/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/BigQueryIO.java#L1965]
> When the temp files are spilt to multiple load jobs, the write disposition is
> set to WRITE_TRUNCATE for the temporary tables in favor of retries.
> [https://github.com/apache/beam/blob/a8a8b3ab1292ff408940ba4430c95bc754341589/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/WriteTables.java#L219]
> And the loading would fail with error
> "Schema update options should only be specified with WRITE_APPEND
> disposition, or with WRITE_TRUNCATE disposition on a table partition."
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)