lidavidm opened a new pull request #9613: URL: https://github.com/apache/arrow/pull/9613
For applications that want to pipeline I/O and Parquet decoding, and don't want to be blocked on I/O (instead decoding data as it becomes available), this allows them to pre-buffer the data they're about to read, then explicitly be signaled when the buffering completes. Once we also make the Parquet reader re-entrant, we can expose this under a more ergonomic API as `Future<AsyncGenerator<shared_ptr<RecordBatch>>> parquet::arrow::FileReader::ReadRowGroupsAsync(...)` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
