adriangb commented on issue #17171: URL: https://github.com/apache/datafusion/issues/17171#issuecomment-3283565041
One note about a cool thing we could do with bloom filters: in theory you can compute an intersection in `O(size of the bloom filter)`. If we push down a bloom filter for col `a` and the Parquet file has a row group bloom filter on that column `a` we can do very fast and accurate (better than min/max although still lossy) pruning of row groups without reading the actual data. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org