[GitHub] spark issue #14411: [SPARK-16804][SQL] Correlated subqueries containing LIMI...

nsyca Mon, 01 Aug 2016 06:41:21 -0700

Github user nsyca commented on the issue:

    https://github.com/apache/spark/pull/14411
  
    @hvanhovell,
    
    Thank you for your comment. There are quite a few patterns being 
blacklisted already, such as correlation under set operators (UNION, EXCEPT, 
INTERSECT), correlation outside of WHERE/HAVING context, correlation in the 
right table of a LEFT [OUTER] JOIN (and the left table of a RIGHT [OUTER] 
JOIN]). I am working on discovering more issues in this area but it looks like 
a bigger project to me. I have a general idea that the rewrite of correlation 
subquery to join should not happen in the Analysis phase. We should build a 
Logical plan to represent the subquery and perform the rewrite at the Optimizer 
phase instead.
    
    I am new to the Spark code and this is my first PR. So I'd like to make it 
a small, self-contained project to gain my confidence in working with the code.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #14411: [SPARK-16804][SQL] Correlated subqueries containing LIMI...

Reply via email to