[
https://issues.apache.org/jira/browse/BEAM-12865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Quentin Sommer updated BEAM-12865:
----------------------------------
Description:
Hi,
We allow customising the {{batch_size}} when streaming to BigQuery but the
batch duration (used by {{GroupIntoBatches}}) is set to
{{DEFAULT_BATCH_BUFFERING_DURATION_LIMIT_SEC}} (0.2)
I'd like to add the option to specify the batch duration to allow better
batching for scenarios with little data throughput.
It will use the {{triggering_frequency}} param already used when doing batch
file loads
was:
Hi,
We allow customising the {{batch_size}} when streaming to BigQuery but the
batch duration (used by {{GroupIntoBatches}}) is set to
{{DEFAULT_BATCH_BUFFERING_DURATION_LIMIT_SEC}} (0.2)
I'd like to add the option to specify the {{batch_duration}} to allow better
batching for scenarios with little data throughput.
> Allow customising batch duration when streaming with WriteToBigQuery
> --------------------------------------------------------------------
>
> Key: BEAM-12865
> URL: https://issues.apache.org/jira/browse/BEAM-12865
> Project: Beam
> Issue Type: New Feature
> Components: io-py-gcp
> Affects Versions: Not applicable
> Reporter: Quentin Sommer
> Priority: P2
> Labels: stale-P2
> Fix For: Not applicable
>
> Time Spent: 8h 50m
> Remaining Estimate: 0h
>
> Hi,
> We allow customising the {{batch_size}} when streaming to BigQuery but the
> batch duration (used by {{GroupIntoBatches}}) is set to
> {{DEFAULT_BATCH_BUFFERING_DURATION_LIMIT_SEC}} (0.2)
> I'd like to add the option to specify the batch duration to allow better
> batching for scenarios with little data throughput.
> It will use the {{triggering_frequency}} param already used when doing batch
> file loads
--
This message was sent by Atlassian Jira
(v8.20.1#820001)