[ 
https://issues.apache.org/jira/browse/BEAM-11648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17441559#comment-17441559
 ] 

Marcin Kuthan commented on BEAM-11648:
--------------------------------------

Existing Streaming API is marked as deprecated: 
[https://cloud.google.com/bigquery/streaming-data-into-bigquery.] BigQuery 
documentation recommends new Storage Write API but Beam seems to be far, far 
behind. It looks that implementation already exists: 
[https://beam.apache.org/releases/javadoc/2.33.0/org/apache/beam/sdk/io/gcp/bigquery/BigQueryIO.Write.Method.html#STORAGE_WRITE_API]
 but:
 # Documentation is missing (JavaDoc "Use the new, experimental Storage Write 
API." is not enough, really).
 # I don't know which BigQueryIO features are supported in STORAGE_WRITE_API 
(e.g. dynamic destination, retry policies, insert ids, invalid rows handling, 
beam schema, auto sharding, clustering, partitioning, etc.).
 # I don't know which Beam version supports new API (Jira "fix version" is not 
set)

Existing Streaming Insert method has major limitations (low throughput, high 
costs), we really need decent support for Storage Write API.

> Implement new BigQuery sink (Vortex)
> ------------------------------------
>
>                 Key: BEAM-11648
>                 URL: https://issues.apache.org/jira/browse/BEAM-11648
>             Project: Beam
>          Issue Type: New Feature
>          Components: extensions-java-gcp
>            Reporter: Reuven Lax
>            Priority: P3
>          Time Spent: 49h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to