adriangb opened a new pull request, #19932: URL: https://github.com/apache/datafusion/pull/19932
## Summary - Adds `hash_join_map_pushdown` config option to control hash table reference pushdown - Adds `hash_join_bounds_pushdown` config option to control min/max bounds pushdown ## Details This PR adds two new configuration options to independently control hash join dynamic filter pushdown behavior: - `hash_join_map_pushdown` (default: true): Controls whether to push down hash table references for membership checks when InList thresholds are exceeded. When false, no membership filter is created if the build side is too large for InList pushdown. - `hash_join_bounds_pushdown` (default: true): Controls whether to push down min/max bounds for join key columns. When false, only membership filters (InList or Map) are pushed down. This enables flexible combinations: - InList only (no bounds, no map) - Bounds only (no inlist, no map) - Map only (no bounds) - InList + bounds (no map) - Any combination thereof The primary use case is allowing users to use InList + bounds without the map fallback, which can be beneficial for certain workloads where the hash table reference overhead outweighs its benefits. ## Test plan - [x] Existing hash_join tests pass (782 tests) - [ ] CI validation 🤖 Generated with [Claude Code](https://claude.com/claude-code) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
