[
https://issues.apache.org/jira/browse/ARROW-5131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16913741#comment-16913741
]
Wes McKinney commented on ARROW-5131:
-------------------------------------
Our medium/long-term plan in Apache Arrow is to support cloud filesystems in
C++. See initial steps in this direction to support Amazon S3
https://github.com/apache/arrow/pull/5167
> [Python] Add Azure Datalake Filesystem Gen1 Wrapper for pyarrow
> ---------------------------------------------------------------
>
> Key: ARROW-5131
> URL: https://issues.apache.org/jira/browse/ARROW-5131
> Project: Apache Arrow
> Issue Type: Wish
> Components: Python
> Affects Versions: 0.12.1
> Reporter: Gregory Hayes
> Priority: Minor
> Labels: pull-request-available
> Time Spent: 5.5h
> Remaining Estimate: 0h
>
> The current pyarrow package can only read parquet files that have been
> written to Gen1 Azure Datalake using the fastparquet engine. This only works
> if the dask-adlfs package is explicitly installed and imported. I've added a
> method to the dask-adlfs package, found
> [here|https://github.com/dask/dask-adlfs], and issued a PR for that change.
> To support this capability, added an ADLFSWrapper to filesystem.py file.
--
This message was sent by Atlassian Jira
(v8.3.2#803003)