adriangb opened a new pull request, #19932:
URL: https://github.com/apache/datafusion/pull/19932

   ## Summary
   - Adds `hash_join_map_pushdown` config option to control hash table 
reference pushdown
   - Adds `hash_join_bounds_pushdown` config option to control min/max bounds 
pushdown
   
   ## Details
   
   This PR adds two new configuration options to independently control hash 
join dynamic filter pushdown behavior:
   
   - `hash_join_map_pushdown` (default: true): Controls whether to push down 
hash table references for membership checks when InList thresholds are 
exceeded. When false, no membership filter is created if the build side is too 
large for InList pushdown.
   
   - `hash_join_bounds_pushdown` (default: true): Controls whether to push down 
min/max bounds for join key columns. When false, only membership filters 
(InList or Map) are pushed down.
   
   This enables flexible combinations:
   - InList only (no bounds, no map)
   - Bounds only (no inlist, no map)  
   - Map only (no bounds)
   - InList + bounds (no map)
   - Any combination thereof
   
   The primary use case is allowing users to use InList + bounds without the 
map fallback, which can be beneficial for certain workloads where the hash 
table reference overhead outweighs its benefits.
   
   ## Test plan
   - [x] Existing hash_join tests pass (782 tests)
   - [ ] CI validation
   
   🤖 Generated with [Claude Code](https://claude.com/claude-code)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to