Ted-Jiang opened a new issue, #2358:
URL: https://github.com/apache/arrow-rs/issues/2358

   The skip records API added to the ArrayReader trait as part of 
https://github.com/apache/arrow-rs/pull/1998 does not provide a way to combine 
multiple selections into the same batch. This is unfortunate as columnar query 
engines will often want consistently large RecordBatch so that any dispatch 
overheads can be amortised over many rows. Whilst it could concatenate batches 
together, e.g. DataFusion's CoalesceBatchesExec, it would be more efficient to 
do this directly on read and eliminate an additional copy.
   
   _Originally posted by @tustvold in 
https://github.com/apache/arrow-rs/issues/2197#issuecomment-1197934926_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to