Hyoungjun Kim created TAJO-972:
----------------------------------

             Summary: Broadcast join with left outer join returns duplicated 
rows.
                 Key: TAJO-972
                 URL: https://issues.apache.org/jira/browse/TAJO-972
             Project: Tajo
          Issue Type: Bug
            Reporter: Hyoungjun Kim
            Assignee: Hyoungjun Kim
            Priority: Minor


If LEFT OUTER JOIN has broadcast table and broadcast target table is left side, 
every tasks run join operation with all rows in broadcast table. So some tasks 
match and other tasks doesn't match. 
For example:
{noformat}
default>select * from small
id
-----------------
1
2
3

default>select * from large
1
4    <-- Block1 in HDFS
5
...
2    <-- Block2 in HDFS
6

default> select a.id, b.id from small a left outer join large b on a.id = b.id
a.id    b.id
---------------------------
1  1
2  null
3  null
1  null
2  2
3  null
{noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to