[ 
https://issues.apache.org/jira/browse/CALCITE-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15072536#comment-15072536
 ] 

Julian Hyde commented on CALCITE-1037:
--------------------------------------

Yes, I suppose you have to treat Correlate the same way as a cartesian join. 
(Which is the same as a join where the left and right keys have 0 columns.)

Your work is definitely an improvement over what we have currently. If you add 
one or two tests to RelMetadataTest and create a pull request I'd be happy to 
accept it.

> Column uniqueness is calculated incorrectly for 'Correlate' expression
> ----------------------------------------------------------------------
>
>                 Key: CALCITE-1037
>                 URL: https://issues.apache.org/jira/browse/CALCITE-1037
>             Project: Calcite
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.5.0
>            Reporter: Alexey Makhmutov
>            Assignee: Julian Hyde
>
> Column uniqueness is calculated incorrectly for 'Correlate' expression -- and 
> in some cases this leads to java.lang.IndexOutOfBoundsException. Example of 
> such code:
> {code}select
>  x.v
> from
>  (
>   select
>    t1.v
>   from
>    (values (1,1),(1,2)) as t1(k,v) 
>    join (values (1)) as t2(k) on t1.k=t2.k
>  ) x,
>  lateral
>  (
>   select 
>    t.v
>   from
>    unnest(multiset[x.v]) as t(v)
>  ) y
> group by x.v,y.v{code}
> The problems seems to be related to the 
> org.apache.calcite.rel.metadata.RelMdColumnUniqueness.areColumnsUnique(Correlate
>  rel, ImmutableBitSet columns, boolean ignoreNulls) method -- it just 
> delegates uniqueness check to left input without changing columns list, which 
> leads to Exception if this list references columns from right input.
> It seems, that right behavior should be following:
> * For Anti/Semi join type keep the current behavior (as resulting rows 
> contains fields only from left input).
> * For Left/Inner join type columns set for correlate is unique only if it 
> includes unique sets from both sides.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to