[ 
https://issues.apache.org/jira/browse/ARROW-9538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17631653#comment-17631653
 ] 

Joris Van den Bossche commented on ARROW-9538:
----------------------------------------------

This is working for {{pyarrow.fs}}, and so it should also work for all methods 
like {{pq.read_table(..)}} or {{ds.dataset(..)}} that use this under the hood, 
and so those should support accepting a S3 URI.

{code}
In [1]: import pyarrow.fs

In [2]: pyarrow.fs._resolve_filesystem_and_path("s3://bucket/folder/file.ext")
Out[2]: (<pyarrow._s3fs.S3FileSystem at 0x7f638c707db0>, 
'bucket/folder/file.ext')
{code}

Since pyarrow.filesystem module is deprecated and we are not developing that 
anymore, going to close this issue.

> [Python] Allow pyarrow.filesystem.resolve_filesystem_and_path to parse S3 URL
> -----------------------------------------------------------------------------
>
>                 Key: ARROW-9538
>                 URL: https://issues.apache.org/jira/browse/ARROW-9538
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>    Affects Versions: 0.17.1
>            Reporter: Adam Avilla
>            Priority: Minor
>              Labels: filesystem
>
> {{pyarrow.filesystem.resolve_filesystem_and_path}} should support a {{where}} 
> that is a S3 URL like:
> {code:java}
> s3://bucket/folder/file.ext{code}
> It seems like all the pieces are there but was never developed. If given some 
> light guidance I may be able to add the code in a PR.
> Thanks and LMK if this is a crazy request!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to