[jira] [Commented] (ARROW-1213) [Python] Enable s3fs to be used with ParquetDataset and reader/writer functions

Wes McKinney (JIRA) Tue, 18 Jul 2017 21:43:46 -0700

    [ 
https://issues.apache.org/jira/browse/ARROW-1213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092560#comment-16092560
 ]


Wes McKinney commented on ARROW-1213:
-------------------------------------

I started working up a patch and need to do a bit of refactoring to make this 
simpler, so I took this off the 0.5.0 release so it doesn't hold up the release 
candidate. I figure this will take more than a day of testing to be suitably 
hardened for production, so let's work together so we can have this working 
robustly in 0.6.0. As soon as it's working we can set up conda builds to make 
deployment and testing easier until the 0.6.0 release goes out (hopefully 
within 4-5 weeks after 0.5.0).

> [Python] Enable s3fs to be used with ParquetDataset and reader/writer 
> functions
> -------------------------------------------------------------------------------
>
>                 Key: ARROW-1213
>                 URL: https://issues.apache.org/jira/browse/ARROW-1213
>             Project: Apache Arrow
>          Issue Type: Improvement
>            Reporter: Yacko
>            Assignee: Wes McKinney
>            Priority: Minor
>             Fix For: 0.6.0
>
>
> Pyarrow dataset function can't read from s3 using s3fs as the filesystem. Is  
> there a way we can add the support for read from s3 based on partitioned 
> files ?
> I am trying to address the problem mentioned in the stackoverflow link :
> https://stackoverflow.com/questions/45082832/how-to-read-partitioned-parquet-files-from-s3-using-pyarrow-in-python



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Commented] (ARROW-1213) [Python] Enable s3fs to be used with ParquetDataset and reader/writer functions

Reply via email to