jorisvandenbossche commented on pull request #10991: URL: https://github.com/apache/arrow/pull/10991#issuecomment-904871131
This is very much work-in-progress for now. It's a basic implementation of the sync ScanFile, while I should take a look at ScanBatchesAsync instead, it still needs to support ORC options to read specific columns only (for projection pushdown), to set the batch size, no writing support yet, still need to add C++ tests and more python tests, etc. But the basics work: you can open and read an ORC dataset. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
