[
https://issues.apache.org/jira/browse/ARROW-1213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092560#comment-16092560
]
Wes McKinney commented on ARROW-1213:
-------------------------------------
I started working up a patch and need to do a bit of refactoring to make this
simpler, so I took this off the 0.5.0 release so it doesn't hold up the release
candidate. I figure this will take more than a day of testing to be suitably
hardened for production, so let's work together so we can have this working
robustly in 0.6.0. As soon as it's working we can set up conda builds to make
deployment and testing easier until the 0.6.0 release goes out (hopefully
within 4-5 weeks after 0.5.0).
> [Python] Enable s3fs to be used with ParquetDataset and reader/writer
> functions
> -------------------------------------------------------------------------------
>
> Key: ARROW-1213
> URL: https://issues.apache.org/jira/browse/ARROW-1213
> Project: Apache Arrow
> Issue Type: Improvement
> Reporter: Yacko
> Assignee: Wes McKinney
> Priority: Minor
> Fix For: 0.6.0
>
>
> Pyarrow dataset function can't read from s3 using s3fs as the filesystem. Is
> there a way we can add the support for read from s3 based on partitioned
> files ?
> I am trying to address the problem mentioned in the stackoverflow link :
> https://stackoverflow.com/questions/45082832/how-to-read-partitioned-parquet-files-from-s3-using-pyarrow-in-python
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)