gene-bordegaray opened a new issue, #23183:
URL: https://github.com/apache/datafusion/issues/23183

   ### Is your feature request related to a problem or challenge?
   
   Part of #22395.
   
   DataFusion can represent source-declared `Partitioning::Range`, but inner 
partitioned hash joins still require hash-partitioned inputs. This can add 
unnecessary hash repartitions even when both join inputs are already 
partitioned by compatible ranges on the join keys.
   
   ### Describe the solution you'd like
   
   Allow inner partitioned hash joins to accept inputs that are co-partitioned 
on their join keys, including compatible range-partitioned inputs.
   
   This issue tracks a focused PR that:
   
   - adds a key-based distribution requirement that can be satisfied by hash or 
range partitioning
   - checks that both join sides can be paired by partition index
   - keeps incompatible range bounds, mismatched hash counts, and non-inner 
joins on the existing hash repartition path
   - keeps partitioned dynamic filters restricted to hash-compatible routing
   
   ### Additional context
   
   This is a focused step in the range partitioning epic (#22395), specifically 
around preserving range partitioning through partitioned joins when doing so is 
proven compatible.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to