[ 
https://issues.apache.org/jira/browse/HIVE-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840802#action_12840802
 ] 

Namit Jain commented on HIVE-1194:
----------------------------------

smb_mapjoin4.q:


POSTHOOK: query: select /*+mapjoin(a,b)*/ * from smb_bucket_1 a left outer join 
smb_bucket_2 b on a.key \
= b.key left outer join smb_bucket_3 c on b.key=c.key
POSTHOOK: type: QUERY
POSTHOOK: Input: defa...@smb_bucket_2
POSTHOOK: Input: defa...@smb_bucket_3
POSTHOOK: Input: defa...@smb_bucket_1
POSTHOOK: Output: 
file:/Users/heyongqiang/Documents/workspace/Hive-Test/build/ql/scratchdir/hive_2010-03\
-02_16-28-42_346_3202067314016412424/10000
1 val_1 NULL  NULL  NULL  NULL
3 val_3 NULL  NULL  NULL  NULL
4 val_4 NULL  NULL  NULL  NULL
5 val_5 NULL  NULL  NULL  NULL
10  val_10  NULL  NULL  NULL  NULL


I am not sure if the above semantics are correct - this may be a existing bug 
in the code, can you check the semantics of mysql and oracle ?

> sorted merge join
> -----------------
>
>                 Key: HIVE-1194
>                 URL: https://issues.apache.org/jira/browse/HIVE-1194
>             Project: Hadoop Hive
>          Issue Type: New Feature
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: He Yongqiang
>             Fix For: 0.6.0
>
>         Attachments: hive-1194-2010-02-28.patch, hive-1194-2010-3-2.2.patch, 
> hive-1194-2010-3-2.patch
>
>
> If the input tables are sorted on the join key, and a mapjoin is being 
> performed, it is useful to exploit the sorted properties of the table.
> This can lead to substantial cpu savings - this needs to work across bucketed 
> map joins also.
> Since, sorted properties of a table are not enforced currently, a new 
> parameter can be added to specify to use the sort-merge join.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to