[
https://issues.apache.org/jira/browse/ARROW-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-13572:
------------------------------------------
Summary: [C++][Python] Add basic ORC support to the pyarrow.datasets API
(was: [Python][ORC] Add ORC support to the pyarrow.datasets API)
> [C++][Python] Add basic ORC support to the pyarrow.datasets API
> ---------------------------------------------------------------
>
> Key: ARROW-13572
> URL: https://issues.apache.org/jira/browse/ARROW-13572
> Project: Apache Arrow
> Issue Type: New Feature
> Components: Python
> Reporter: Rick Zamora
> Assignee: Joris Van den Bossche
> Priority: Major
> Labels: orc, pull-request-available
> Fix For: 6.0.0
>
> Time Spent: 1h 20m
> Remaining Estimate: 0h
>
> There is significant interest in having directory-partitioned ORC support
> from users of Dask. Since Dask already leverages the pyarrow.datasets API
> for parquet-formatted data, having ORC support through the same pyarrow API
> would be extremely useful.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)