[ 
https://issues.apache.org/jira/browse/TAJO-972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071297#comment-14071297
 ] 

Hudson commented on TAJO-972:
-----------------------------

SUCCESS: Integrated in Tajo-master-build #311 (See 
[https://builds.apache.org/job/Tajo-master-build/311/])
TAJO-972: Broadcast join with left outer join returns duplicated 
rows.(Hyoungjun Kim via jaehwa) (blrunner: rev 
a5de837209a8d6d9685ad1aa8132b3b4ecd99727)
* 
tajo-core/src/main/java/org/apache/tajo/engine/planner/global/GlobalPlanner.java
* CHANGES
* 
tajo-core/src/test/java/org/apache/tajo/engine/planner/global/TestBroadcastJoinPlan.java
* tajo-core/src/test/java/org/apache/tajo/engine/query/TestJoinBroadcast.java


> Broadcast join with left outer join returns duplicated rows.
> ------------------------------------------------------------
>
>                 Key: TAJO-972
>                 URL: https://issues.apache.org/jira/browse/TAJO-972
>             Project: Tajo
>          Issue Type: Bug
>            Reporter: Hyoungjun Kim
>            Assignee: Hyoungjun Kim
>            Priority: Minor
>
> If LEFT OUTER JOIN has broadcast table and broadcast target table is left 
> side, every tasks run join operation with all rows in broadcast table. So 
> some tasks match and other tasks doesn't match. 
> For example:
> {noformat}
> default>select * from small
> id
> -----------------
> 1
> 2
> 3
> default>select * from large
> 1
> 4    <-- Block1 in HDFS
> 5
> ...
> 2    <-- Block2 in HDFS
> 6
> default> select a.id, b.id from small a left outer join large b on a.id = b.id
> a.id    b.id
> ---------------------------
> 1  1
> 2  null
> 3  null
> 1  null
> 2  2
> 3  null
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to