Re: [I] [R] Add support for `%in%` within partitions [arrow]

via GitHub Wed, 02 Jul 2025 11:48:12 -0700


JerePlum99 commented on issue #46681:
URL: https://github.com/apache/arrow/issues/46681#issuecomment-3028985233


   @thisisnic I'm on the R `v15.0.1` version, so probably not that bug. It is 
specifically the performance speed - I noticed that in large S3 datasets a 
simple == filter that leverages partitions is very quick, while on the same 
dataset a `%in%` filter that opens 2 of the categories seems to lose the 
partitioning speed. 
   
   I can try bumping up the package and testing if that's relevant, I forget 
why but I had pegged to this version because of some other instability at the 
time. I work a lot with `duckdb` and `duckplyr` as well. 
   
   Will test out a bit more next week and see if I can narrow in on it


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [I] [R] Add support for `%in%` within partitions [arrow]

Reply via email to