Kenneth Jung created BEAM-11226:
-----------------------------------

             Summary: Support larger tables in BigQuery storage API bounded 
source
                 Key: BEAM-11226
                 URL: https://issues.apache.org/jira/browse/BEAM-11226
             Project: Beam
          Issue Type: Improvement
          Components: io-java-gcp
    Affects Versions: 2.25.0
            Reporter: Kenneth Jung


The BigQuery storage API client specifies a default deadline of 10 minutes for 
calls to 
[CreateReadSession](https://cloud.google.com/bigquery/docs/reference/storage#create_a_session).
 However, for large tables, this operation can take longer than ten minutes, 
and as a result, read session creation simply fails for these tables. Some Beam 
runners, such as Google Cloud Dataflow, allow much longer for initial split 
calls on bounded sources -- 2 hours, in the case of Dataflow. We can support 
larger BigQuery tables with the storage API source by raising the client-side 
deadline for CreateReadSession to match this 2-hour limit.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to