[ 
https://issues.apache.org/jira/browse/FLINK-30049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jingsong Lee updated FLINK-30049:
---------------------------------
    Description: 
{code:java}
Caused by: org.apache.flink.util.SerializedThrowable: Cannot sync state to 
system like S3. Use persist() to create a persistent recoverable intermediate 
point.
        at 
org.apache.flink.core.fs.RefCountedBufferingFileStream.sync(RefCountedBufferingFileStream.java:111)
 
        at 
org.apache.flink.fs.s3.common.writer.S3RecoverableFsDataOutputStream.sync
        at 
org.apache.flink.formats.csv.CsvBulkWriter.finish(CsvBulkWriter.java:106) 
        at 
org.apache.flink.connector.file.table.FileSystemTableSink$ProjectionBulkFactory$1.finish(FileSystemTableSink.java:653)
 
        at 
org.apache.flink.streaming.api.functions.sink.filesystem.BulkPartWriter.closeForCommit(BulkPartWriter.java:64)
 
{code}

It looks like we should not call `sync` in CsvBulkWriter, we should just use 
`flush`.


  was:
{code:java}
Caused by: org.apache.flink.util.SerializedThrowable: Cannot sync state to 
system like S3. Use persist() to create a persistent recoverable intermediate 
point.
        at 
org.apache.flink.core.fs.RefCountedBufferingFileStream.sync(RefCountedBufferingFileStream.java:111)
 
        at 
org.apache.flink.fs.s3.common.writer.S3RecoverableFsDataOutputStream.sync
        at 
org.apache.flink.formats.csv.CsvBulkWriter.finish(CsvBulkWriter.java:106) 
        at 
org.apache.flink.connector.file.table.FileSystemTableSink$ProjectionBulkFactory$1.finish(FileSystemTableSink.java:653)
 
        at 
org.apache.flink.streaming.api.functions.sink.filesystem.BulkPartWriter.closeForCommit(BulkPartWriter.java:64)
 
{code}



> CsvBulkWriter is unsupported for S3 FileSystem in streaming sink
> ----------------------------------------------------------------
>
>                 Key: FLINK-30049
>                 URL: https://issues.apache.org/jira/browse/FLINK-30049
>             Project: Flink
>          Issue Type: Bug
>          Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
>    Affects Versions: 1.16.0, 1.15.2
>            Reporter: Jingsong Lee
>            Priority: Major
>
> {code:java}
> Caused by: org.apache.flink.util.SerializedThrowable: Cannot sync state to 
> system like S3. Use persist() to create a persistent recoverable intermediate 
> point.
>       at 
> org.apache.flink.core.fs.RefCountedBufferingFileStream.sync(RefCountedBufferingFileStream.java:111)
>  
>       at 
> org.apache.flink.fs.s3.common.writer.S3RecoverableFsDataOutputStream.sync
>       at 
> org.apache.flink.formats.csv.CsvBulkWriter.finish(CsvBulkWriter.java:106) 
>       at 
> org.apache.flink.connector.file.table.FileSystemTableSink$ProjectionBulkFactory$1.finish(FileSystemTableSink.java:653)
>  
>       at 
> org.apache.flink.streaming.api.functions.sink.filesystem.BulkPartWriter.closeForCommit(BulkPartWriter.java:64)
>  
> {code}
> It looks like we should not call `sync` in CsvBulkWriter, we should just use 
> `flush`.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to