Dandandan opened a new issue, #23112: URL: https://github.com/apache/datafusion/issues/23112
### Is your feature request related to a problem or challenge? SortMergeJoinExec currently always runs in a symmetric, hash-partitioned mode: both inputs are hash-partitioned on the join keys and sorted, and partition i of the left is merge-joined with partition i of the right. When one side is small, hash-repartitioning the large side is wasteful. For hash joins this is already handled by PartitionMode::CollectLeft (collect the small build side once, broadcast it, don't repartition the probe side). This PR brings the same idea to sort-merge joins. ### Describe the solution you'd like Implement CollectLeft mode for SMJ ### Describe alternatives you've considered _No response_ ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
