qyw created FLINK-35150:
---------------------------
Summary: The specified upload does not exist. The upload ID may be
invalid
Key: FLINK-35150
URL: https://issues.apache.org/jira/browse/FLINK-35150
Project: Flink
Issue Type: Bug
Components: Connectors / FileSystem
Affects Versions: 1.15.0
Reporter: qyw
Attachments: image-2024-04-18-10-51-05-071.png,
image-2024-04-18-11-03-08-998.png, image-2024-04-18-11-07-15-555.png
Flink S3 hadoop, write S3 in csv mode, I used this patch
[FLINK-28513|https://issues.apache.org/jira/browse/FLINK-28513] . But I don't
understand why S3RecoverableFsDataOutputStream "sync" method of this class to
be "completeMultipartUpload" operation, if "completeMultipartUpload" here,
Calling close later to upload the rest of the stream will inevitably result in
an error. The part corresponding to uploadID has been merged.
Therefore, when the message in csv is larger than "S3_MULTIPART_MIN_PART_SIZE",
the uploadPart will be started when switching files, then when BulkPartWriter
performs closeForCommit, Due to the sync S3RecoverableFsDataOutputStream method
call completeMultipartUpload, So S3RecoverableFsDataOutputStream
"closeForCommit" method due to the uploadPart, at this time will lead to errors.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)