[
https://issues.apache.org/jira/browse/ARROW-3781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16685809#comment-16685809
]
Wes McKinney commented on ARROW-3781:
-------------------------------------
Sorry, I'm using file systems here again proverbially. TensorFlow and other
projects call their integrations with other file storage systems "file
systems", e.g.
https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/platform/s3/s3_file_system.h#L25
I am not sure a Write or Flush into S3 is necessarily going to be asynchronous.
The implementation in TensorFlow of Flush blocks until the PutRequest is
completed
https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/platform/s3/s3_file_system.cc#L238
> [C++] Configure buffer size in arrow::io::BufferedOutputStream
> --------------------------------------------------------------
>
> Key: ARROW-3781
> URL: https://issues.apache.org/jira/browse/ARROW-3781
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Wes McKinney
> Priority: Major
> Fix For: 0.12.0
>
>
> This is hard-coded to 4096 right now. For higher latency file systems it may
> be desirable to use a larger buffer. See also ARROW-3777 about performance
> testing for high latency files
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)