peter-toth opened a new pull request, #53142:
URL: https://github.com/apache/spark/pull/53142

   ### What changes were proposed in this pull request?
   
   Fix `KeyGroupedShuffleSpec.createPartitioning()` as clustering required at 
the other side of the join might contain more clustering expressions than the 
number of expressions in the shuffle spec's `KeyGroupedPartitioning`, so simply 
zipping them is not correct.
   
   ### Why are the changes needed?
   
   Fix a correctness issue due to wrong partitioning on the shuffle side.
   
   ### Does this PR introduce _any_ user-facing change?
   
   Yes, it fixes the query.
   
   ### How was this patch tested?
   
   Added new UT.
   
   ### Was this patch authored or co-authored using generative AI tooling?
   
   No.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to