aokolnychyi edited a comment on pull request #31083:
URL: https://github.com/apache/spark/pull/31083#issuecomment-758723800


   @cloud-fan, let's try to make the required distribution and ordering work. I 
have one case that I am not sure how to address using this approach: explicit 
sorts/repartitions that have no effect. We can probably cover cases when adding 
sorts/repartitions is redundant (as we can check the child distribution and 
ordering) but I am not sure we can remove sorts/repartitions that have no 
effect if we don't go through the optimizer.
   
   For example, if someone does a manual global sort by (c1, c2) but the table 
requires a global sort by (c3, c4). Even though the manual sort will have no 
effect, we will still perform it. That's why we will end up with 2 global 
sorts. Under the current approach, this case is handled and the manual sort 
will be removed (thanks to `EliminateSorts` in the optimizer).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to