adriangb commented on issue #17171:
URL: https://github.com/apache/datafusion/issues/17171#issuecomment-3283565041

   One note about a cool thing we could do with bloom filters: in theory you 
can compute an intersection in `O(size of the bloom filter)`. If we push down a 
bloom filter for col `a` and the Parquet file has a row group bloom filter on 
that column `a` we can do very fast and accurate (better than min/max although 
still lossy) pruning of row groups without reading the actual data.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to