Hey all, I’m trying to use the field clustering beta feature in bigquery [1]. However, the current Beam/dataflow worker bigquery api service dependency is ‘google-api-services-bigquery: com.google.apis: v2-rev374-1.23.0’, which does not include the clustering option in the TimePartitioning class. Hereby, I can’t specify the clustering field when loading/streaming into bigquery. See [2] for the bigquery api error details.
Does anyone know a workaround for this? I guess that in the worst case I’ll have to wait until Beam supports a newer version of the bigquery api service. 1. After checking the Beam Jira I’ve found BEAM-5191<https://jira.apache.org/jira/browse/BEAM-5191>. Is there any way I can help to push this forward and make this feature possible in the near future? Thanks in advance, Wout [1] https://cloud.google.com/bigquery/docs/clustered-tables [2] "errorResult" : { "message" : "Incompatible table partitioning specification. Expects partitioning specification interval(type:day,field:publish_time) clustering(clustering_id), but input partitioning specification is interval(type:day,field:publish_time)", "reason" : "invalid" }
