RinChanNOWWW opened a new issue, #4864: URL: https://github.com/apache/arrow-rs/issues/4864
Both in `arrow_reader` and `async_reader`, if there are predicates, the reader will first read arrays (wrapped in `RecordBatch`) to evaluate the predicates and get a row selection. And then the reader will use the row selection to output the final needed arrays. If some arrays in the final output are contained in the prefetched arrays, we also have to deserialize them again. This is quite wasteful. We should have a reasonable way to reuse the arrays. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
