adriangb commented on code in PR #16424:
URL: https://github.com/apache/datafusion/pull/16424#discussion_r2161929117


##########
datafusion/core/tests/parquet/row_group_pruning.rs:
##########
@@ -421,7 +421,7 @@ macro_rules! int_tests {
                     .with_query(&format!("SELECT * FROM t where i{} in (100)", 
$bits))
                     .with_expected_errors(Some(0))
                     .with_matched_by_stats(Some(0))
-                    .with_pruned_by_stats(Some(0))
+                    .with_pruned_by_stats(Some(4))

Review Comment:
   This is basically reverting the changes made in 
https://github.com/apache/datafusion/pull/16447 because we are now not doing 
file level pruning if there are only static filters.
   
   @alamb maybe this is an argument to still do the file level pruning even if 
there are no dynamic filters. I guess this pruning is _not_ happening anywhere 
else. One could argue it should happen right after statistics are collected, in 
the TableProvider, but that doesn't seem to be the case and maybe it's better 
to just let it happen in `ParquetOpener` (or we could move this pruning up to 
the `DataSourceExec` or something so it works for all sources) so that 
TableProvider complexity is kept to a minimum.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to