[
https://issues.apache.org/jira/browse/TAJO-972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071297#comment-14071297
]
Hudson commented on TAJO-972:
-----------------------------
SUCCESS: Integrated in Tajo-master-build #311 (See
[https://builds.apache.org/job/Tajo-master-build/311/])
TAJO-972: Broadcast join with left outer join returns duplicated
rows.(Hyoungjun Kim via jaehwa) (blrunner: rev
a5de837209a8d6d9685ad1aa8132b3b4ecd99727)
*
tajo-core/src/main/java/org/apache/tajo/engine/planner/global/GlobalPlanner.java
* CHANGES
*
tajo-core/src/test/java/org/apache/tajo/engine/planner/global/TestBroadcastJoinPlan.java
* tajo-core/src/test/java/org/apache/tajo/engine/query/TestJoinBroadcast.java
> Broadcast join with left outer join returns duplicated rows.
> ------------------------------------------------------------
>
> Key: TAJO-972
> URL: https://issues.apache.org/jira/browse/TAJO-972
> Project: Tajo
> Issue Type: Bug
> Reporter: Hyoungjun Kim
> Assignee: Hyoungjun Kim
> Priority: Minor
>
> If LEFT OUTER JOIN has broadcast table and broadcast target table is left
> side, every tasks run join operation with all rows in broadcast table. So
> some tasks match and other tasks doesn't match.
> For example:
> {noformat}
> default>select * from small
> id
> -----------------
> 1
> 2
> 3
> default>select * from large
> 1
> 4 <-- Block1 in HDFS
> 5
> ...
> 2 <-- Block2 in HDFS
> 6
> default> select a.id, b.id from small a left outer join large b on a.id = b.id
> a.id b.id
> ---------------------------
> 1 1
> 2 null
> 3 null
> 1 null
> 2 2
> 3 null
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.2#6252)