[ 
https://issues.apache.org/jira/browse/BEAM-12865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Quentin Sommer updated BEAM-12865:
----------------------------------
    Description: 
Hi,

We allow customising the {{batch_size}} when streaming to BigQuery but the 
batch duration (used by {{GroupIntoBatches}}) is set to 
{{DEFAULT_BATCH_BUFFERING_DURATION_LIMIT_SEC}} (0.2)

I'd like to add the option to specify the batch duration to allow better 
batching for scenarios with little data throughput.
It will use the {{triggering_frequency}} param already used when doing batch 
file loads

  was:
Hi,

We allow customising the {{batch_size}} when streaming to BigQuery but the 
batch duration (used by {{GroupIntoBatches}}) is set to 
{{DEFAULT_BATCH_BUFFERING_DURATION_LIMIT_SEC}} (0.2)

I'd like to add the option to specify the {{batch_duration}} to allow better 
batching for scenarios with little data throughput.


> Allow customising batch duration when streaming with WriteToBigQuery
> --------------------------------------------------------------------
>
>                 Key: BEAM-12865
>                 URL: https://issues.apache.org/jira/browse/BEAM-12865
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-py-gcp
>    Affects Versions: Not applicable
>            Reporter: Quentin Sommer
>            Priority: P2
>              Labels: stale-P2
>             Fix For: Not applicable
>
>          Time Spent: 8h 50m
>  Remaining Estimate: 0h
>
> Hi,
> We allow customising the {{batch_size}} when streaming to BigQuery but the 
> batch duration (used by {{GroupIntoBatches}}) is set to 
> {{DEFAULT_BATCH_BUFFERING_DURATION_LIMIT_SEC}} (0.2)
> I'd like to add the option to specify the batch duration to allow better 
> batching for scenarios with little data throughput.
> It will use the {{triggering_frequency}} param already used when doing batch 
> file loads



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to