alamb commented on PR #8732:
URL: 
https://github.com/apache/arrow-datafusion/pull/8732#issuecomment-1875987861

   > May it is better to add some integration test? But I don't know how check 
whether bloomfilter works with current code. I check it by the metric I add 
myself(like 
https://github.com/apache/arrow-datafusion/compare/main...my-vegetable-has-exploded:arrow-datafusion:metric-sbbf?expand=1)
 , Should I add it to this branch?
   
   I think an integration test as well as the bloom filter metrics would be 
good.
   
   Here are my recommended follow on steps
   1. One PR to add the new metrics to distinguish filtering on bloom filters 
vs statistics
   2. One PR with some integration tests to verify bloom filters are actually 
pruning (maybe following how it is done in 
https://github.com/apache/arrow-datafusion/blob/1179a76567892b259c88f08243ee01f05c4c3d5c/datafusion/core/tests/parquet/row_group_pruning.rs#L42)
   
   Also, I am making a PR to clarify and improve comments on the 
PruningPredicate here
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to