adriangb opened a new pull request, #18436:
URL: https://github.com/apache/datafusion/pull/18436

   ## Summary
   
   Pure refactoring of hash join infrastructure to improve code organization 
and prepare for future filter pushdown optimizations. **No behavioral changes.**
   
   ## Changes
   
   - Added `SharedBuildAccumulator` for coordinating build-side state
   - Added `ColumnBounds` and `PartitionBounds` structures
   - Added `inlist_builder.rs` module for InList construction utilities
   - Added `partitioned_hash_eval.rs` module with hash expression infrastructure
   - Changed hash maps from `Box<dyn JoinHashMapType>` to `Arc<dyn 
JoinHashMapType>` for future sharing
   - All new code marked with `#[allow(dead_code)]` as it's infrastructure for 
future PRs
   
   ## Value
   
   - Cleaner hash join architecture
   - Easier to maintain and extend
   - Minimizes future diff when adding features
   
   ## Testing
   
   - ✅ All clippy checks pass with `-D warnings`
   - ✅ 583 hash join tests passing
   - ✅ No behavioral changes (all existing tests pass)
   
   ## Part of Multi-PR Strategy
   
   This is **PR4 of 7** in the InList pushdown feature breakdown.
   
   **Tier**: Capabilities (Tier 2)
   **Dependencies**: None (independent refactoring)
   **Can merge**: As soon as approved


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to