[
https://issues.apache.org/jira/browse/ARROW-13795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17406885#comment-17406885
]
Weston Pace commented on ARROW-13795:
-------------------------------------
We could take a background reader approach like we do with CSV. Does the ORC
C++ API use threads under the hood or is it thread safe?
> [C++] Add async version of the ORC Dataset scanner
> --------------------------------------------------
>
> Key: ARROW-13795
> URL: https://issues.apache.org/jira/browse/ARROW-13795
> Project: Apache Arrow
> Issue Type: Sub-task
> Components: C++
> Reporter: Joris Van den Bossche
> Priority: Major
> Labels: dataset, orc
>
> ARROW-13572 (https://github.com/apache/arrow/pull/10991) added basic support
> for ORC file format in the Datasets API, but for now only implemented the
> sync {{OrcFileFormat::ScanFile}}, while we should rather implemented
> {{OrcFileFormat::ScanBatchesAsync}} instead.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)