JerePlum99 commented on issue #46681: URL: https://github.com/apache/arrow/issues/46681#issuecomment-3028985233
@thisisnic I'm on the R `v15.0.1` version, so probably not that bug. It is specifically the performance speed - I noticed that in large S3 datasets a simple == filter that leverages partitions is very quick, while on the same dataset a `%in%` filter that opens 2 of the categories seems to lose the partitioning speed. I can try bumping up the package and testing if that's relevant, I forget why but I had pegged to this version because of some other instability at the time. I work a lot with `duckdb` and `duckplyr` as well. Will test out a bit more next week and see if I can narrow in on it -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org