nealrichardson commented on issue #43627: URL: https://github.com/apache/arrow/issues/43627#issuecomment-2282191866
I think there's something off in the C++ library that's responsible for this regression. I ran `profvis` on this query on the CRAN 16.1.0 version and a dev build, and the hotspot is `RecordBatchReader$read_table` called inside `collect()`, i.e. Acero is slower in evaluating the query and returning the batches. It does not seem to be anything in the R layer. I'll try bisecting history to find out which commit introduces this. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
