[ 
https://issues.apache.org/jira/browse/AIRFLOW-3205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16649995#comment-16649995
 ] 

Gordon Ball commented on AIRFLOW-3205:
--------------------------------------

The behaviour of the MySQL->GCS operator is to split the output into multiple 
files, whereas this is about uploading a single logical file in multiple HTTP 
requests, avoiding a size limit.

 

The former behaviour is useful by itself (eg, for import to BigQuery the 
multiple uploaded files can be imported in parallel, instead of a slow serial 
import of a single file), but is orthogonal to this case.

> GCS: Support multipart upload
> -----------------------------
>
>                 Key: AIRFLOW-3205
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-3205
>             Project: Apache Airflow
>          Issue Type: Improvement
>          Components: gcp
>            Reporter: Gordon Ball
>            Priority: Minor
>
> GoogleCloudStorageHook currently only provides support for uploading files in 
> a single HTTP request. This means that loads fail with SSL errors for files 
> larger than 2GiB (presumably a int32 overflow, might depend on which SSL 
> library is being used). Multipart uploads should be supported to allow large 
> uploads, and possibly increase reliability for smaller uploads.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to