WangGuangxin opened a new pull request #26011: [SPARK-29343]Eliminate sorts 
without limit in the subquery of Join/Aggregation
URL: https://github.com/apache/spark/pull/26011
 
 
   <!--
   Thanks for sending a pull request!  Here are some tips for you:
     1. If this is your first time, please read our contributor guidelines: 
https://spark.apache.org/contributing.html
     2. Ensure you have added or run the appropriate tests for your PR: 
https://spark.apache.org/developer-tools.html
     3. If the PR is unfinished, add '[WIP]' in your PR title, e.g., 
'[WIP][SPARK-XXXX] Your PR title ...'.
     4. Be sure to keep the PR description updated to reflect all changes.
     5. Please write your PR title to summarize what this PR proposes.
     6. If possible, provide a concise example to reproduce the issue for a 
faster review.
   -->
   
   ### What changes were proposed in this pull request?
   The `Sort` without `Limit` operator in `Join/GroupBy` subquery is useless. 
   This PR try to remove this kind of `Sort` operator in `SQL Optimizer`.
   
   ### Why are the changes needed?
   For example,  `select count(1) from (select a from test1 order by a)` is 
equal to `select count(1) from (select a from test1)`. 
   'select * from (select a from test1 order by a) t1 join (select b from 
test2) t2 on t1.a = t2.b' is equal to `select * from (select a from test1) t1 
join (select b from test2) t2 on t1.a = t2.b`.
   
   Remove useless `Sort` operator can import performance. 
   
   
   ### Does this PR introduce any user-facing change?
   No
   
   ### How was this patch tested?
   Adding new UT `RemoveSortInSubquerySuite.scala`
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to