borjavb opened a new issue, #31827:
URL: https://github.com/apache/beam/issues/31827

   ### What would you like to happen?
   
   The default behaviour of  
[BigQueryIO.Write.Method](https://beam.apache.org/releases/javadoc/current/org/apache/beam/sdk/io/gcp/bigquery/BigQueryIO.Write.Method.html)
 for unbounded collections is to use `STREAMING_INSERTS`, which is now 
[categorised as legacy 
](https://cloud.google.com/bigquery/docs/streaming-data-into-bigquery).
   
   Two new methods 
[STORAGE_API_AT_LEAST_ONCE](https://beam.apache.org/releases/javadoc/current/org/apache/beam/sdk/io/gcp/bigquery/BigQueryIO.Write.Method.html#STORAGE_API_AT_LEAST_ONCE)
 and 
[STORAGE_WRITE_API](https://beam.apache.org/releases/javadoc/current/org/apache/beam/sdk/io/gcp/bigquery/BigQueryIO.Write.Method.html#STORAGE_WRITE_API)
 are available, being STORAGE_API_AT_LEAST_ONCE the closest in the underlying 
semantics to STREAMING_INSERTS ([best effort deduplication but no 
guarantees](https://cloud.google.com/bigquery/docs/streaming-data-into-bigquery#dataconsistency)).
 Using the storage API is also cheaper than [the legacy streaming inserts by 
50%, with the first 2TB 
free.](https://cloud.google.com/bigquery/pricing#data_ingestion_pricing).
   
   Should the default method point to `STORAGE_API_AT_LEAST_ONCE` instead of 
keep using `STREAMING_INSERTS`?
   
   ### Issue Priority
   
   Priority: 3 (nice-to-have improvement)
   
   ### Issue Components
   
   - [ ] Component: Python SDK
   - [ ] Component: Java SDK
   - [ ] Component: Go SDK
   - [ ] Component: Typescript SDK
   - [X] Component: IO connector
   - [ ] Component: Beam YAML
   - [ ] Component: Beam examples
   - [ ] Component: Beam playground
   - [ ] Component: Beam katas
   - [ ] Component: Website
   - [ ] Component: Spark Runner
   - [ ] Component: Flink Runner
   - [ ] Component: Samza Runner
   - [ ] Component: Twister2 Runner
   - [ ] Component: Hazelcast Jet Runner
   - [ ] Component: Google Cloud Dataflow Runner


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to