wjones127 commented on PR #13681:
URL: https://github.com/apache/arrow/pull/13681#issuecomment-1191859530

   Based on my experience with the R docs, here a few specific things we should 
note:
   
    * The default value for `retry_time_limit` is 15 minutes, but users may 
often wish to lower this to more like 15 seconds. This is especially important 
to note because of https://issues.apache.org/jira/browse/ARROW-17020
    * To connect to public buckets, you *must* pass `anonymous=True`. See 
https://issues.apache.org/jira/browse/ARROW-17069
    * Unlike `S3FileSystem`, `GcsFileSystem` will only return directories in 
`GetFileInfo` if there are special directory markers created by Arrow. 
Basically this means that if a directory structure was created by some 
mechanism other than a filesystem (as is the case in our 
`voltrondata-labs-datasets` bucket), the "directories" will be invisible. Users 
are therefore encouraged to always list with `recursive=True` in 
`GcsFileSystem`. See https://issues.apache.org/jira/browse/ARROW-17020


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to