LiaCastaneda opened a new issue, #17527: URL: https://github.com/apache/datafusion/issues/17527
### Describe the bug Just a follow up to [this](https://github.com/apache/datafusion/issues/17486#issuecomment-3271707155) comment. Currently, DataFusion computes bounds for all queries that contain a HashJoinExec node whenever the option enable_dynamic_filter_pushdown is set to true (default). It might make sense to compute these bounds only when we explicitly know there is a consumer that will use them. One way to achieve this could be during physical planning: while traversing the plan, check whether there is any scan/leaf node that is “interested in” or supports dynamic filters (determined by `gather_filters_for_pushdown`). This might just require adding some logic to the filter pushdown optimization rule itself I think? Then, only if there is at least one interested consumer, set a flag on HashJoinExec to build the bounds accumulator, otherwise, skip bounds computation entirely. ### To Reproduce _No response_ ### Expected behavior _No response_ ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
