gene-bordegaray opened a new issue, #23183: URL: https://github.com/apache/datafusion/issues/23183
### Is your feature request related to a problem or challenge? Part of #22395. DataFusion can represent source-declared `Partitioning::Range`, but inner partitioned hash joins still require hash-partitioned inputs. This can add unnecessary hash repartitions even when both join inputs are already partitioned by compatible ranges on the join keys. ### Describe the solution you'd like Allow inner partitioned hash joins to accept inputs that are co-partitioned on their join keys, including compatible range-partitioned inputs. This issue tracks a focused PR that: - adds a key-based distribution requirement that can be satisfied by hash or range partitioning - checks that both join sides can be paired by partition index - keeps incompatible range bounds, mismatched hash counts, and non-inner joins on the existing hash repartition path - keeps partitioned dynamic filters restricted to hash-compatible routing ### Additional context This is a focused step in the range partitioning epic (#22395), specifically around preserving range partitioning through partitioned joins when doing so is proven compatible. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
