Lordworms commented on code in PR #13054:
URL: https://github.com/apache/datafusion/pull/13054#discussion_r1833251081


##########
datafusion/physical-plan/src/joins/hash_join.rs:
##########
@@ -1406,12 +1403,24 @@ impl HashJoinStream {
                 self.hashes_buffer.resize(batch.num_rows(), 0);
                 create_hashes(&keys_values, &self.random_state, &mut 
self.hashes_buffer)?;
 
+                let (filtered_batch, filtered_hashes) =
+                    if let Some(dynamic_filter) = &self.dynamic_filter_info {
+                        dynamic_filter.filter_probe_batch(

Review Comment:
   > Hmm, `filter_probe_batch` shouldn't be added as filter on the `hash_join` 
but rather as `PhysicalExpr` (returning boolean) to filter the `ParquetExec`
   
   Sure, I'll refactor that, other than that I found a problem with ParquetExec 
here https://github.com/apache/datafusion/issues/13298. If the pushdown cost is 
much more than do it directly. I am not sure whether it is a 'optimization' or 
not....



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to