[
https://issues.apache.org/jira/browse/FLINK-20918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17268505#comment-17268505
]
Yun Gao commented on FLINK-20918:
---------------------------------
Hi [~Paul Lin], very thanks for opening the issue! One concern to me is that
could we ensure that in all implementations we haveĀ `hsync` is an enhanced
version of `hflush` ? I'm ask so since I think there might be some other
FileSystem or Object Store provide hadoop compatible FileSystems, thus is it
possible that the change might cause different behaviors for some users ?
> Avoid excessive flush of Hadoop output stream
> ---------------------------------------------
>
> Key: FLINK-20918
> URL: https://issues.apache.org/jira/browse/FLINK-20918
> Project: Flink
> Issue Type: Bug
> Components: Connectors / Hadoop Compatibility, FileSystems
> Affects Versions: 1.12.0, 1.11.3
> Reporter: Paul Lin
> Priority: Major
> Labels: pull-request-available
>
> [HadoopRecoverableFsDataOutputStream#sync|https://github.com/apache/flink/blob/67d167ccd45046fc5ed222ac1f1e3ba5e6ec434b/flink-filesystems/flink-hadoop-fs/src/main/java/org/apache/flink/runtime/fs/hdfs/HadoopRecoverableFsDataOutputStream.java#L123]
> calls both `hflush` and `hsync`, whereas `hsync` is an enhanced version of
> `hflush`. We should remove the `hflush` call to avoid the excessive flush.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)