[
https://issues.apache.org/jira/browse/TAJO-972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071281#comment-14071281
]
ASF GitHub Bot commented on TAJO-972:
-------------------------------------
Github user blrunner commented on the pull request:
https://github.com/apache/tajo/pull/89#issuecomment-49826247
+1
Thanks your contribution, it looks good overall.
I'll push it now.
> Broadcast join with left outer join returns duplicated rows.
> ------------------------------------------------------------
>
> Key: TAJO-972
> URL: https://issues.apache.org/jira/browse/TAJO-972
> Project: Tajo
> Issue Type: Bug
> Reporter: Hyoungjun Kim
> Assignee: Hyoungjun Kim
> Priority: Minor
>
> If LEFT OUTER JOIN has broadcast table and broadcast target table is left
> side, every tasks run join operation with all rows in broadcast table. So
> some tasks match and other tasks doesn't match.
> For example:
> {noformat}
> default>select * from small
> id
> -----------------
> 1
> 2
> 3
> default>select * from large
> 1
> 4 <-- Block1 in HDFS
> 5
> ...
> 2 <-- Block2 in HDFS
> 6
> default> select a.id, b.id from small a left outer join large b on a.id = b.id
> a.id b.id
> ---------------------------
> 1 1
> 2 null
> 3 null
> 1 null
> 2 2
> 3 null
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.2#6252)