[ 
https://issues.apache.org/jira/browse/HIVE-8651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14190509#comment-14190509
 ] 

Ashutosh Chauhan commented on HIVE-8651:
----------------------------------------

Thinking more about this. I think this is not an issue. CBO reordered the join 
and Hive still correctly inferred bucketing / sorting properties. Since, these 
properties are recorded per partition and SMBJ & BMJ pay attention to per 
partition properties, bucketing of tables and subsequent joins should still 
work fine. More info on HIVE-933

cc: [~vikram.dixit] If different partitions are bucketed / sorted on different 
keys what happens to SMBJ / BMJ? As per my understanding, those optimized joins 
make checks for all partitions and kicks in only if all partitions are bucketed 
/ sorted on same key. Can you confirm? 
If thats true, than I think this is a non-issue. Although, we should update 
comment on that test  (which tripped me) to reflect new reality.

> CBO: sort column changed in infer_bucket_sort test
> --------------------------------------------------
>
>                 Key: HIVE-8651
>                 URL: https://issues.apache.org/jira/browse/HIVE-8651
>             Project: Hive
>          Issue Type: Sub-task
>          Components: CBO
>            Reporter: Sergey Shelukhin
>            Assignee: Jesús Camacho Rodríguez
>             Fix For: 0.15.0
>
>
> {noformat}
> -Bucket Columns:        [value]                  
> -Sort Columns:          [Order(col:value, order:1)]      
> +Bucket Columns:        [key]                    
> +Sort Columns:          [Order(col:key, order:1)]        
> {noformat}
> That is in the case for "Test join on three tables on different keys, should 
> be bucketed and sorted by latter key"; the sort changed to the first key 
> instead



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to