[ 
https://issues.apache.org/jira/browse/HIVE-11160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15182396#comment-15182396
 ] 

Pengcheng Xiong commented on HIVE-11160:
----------------------------------------

(1)Sure
(2)Yes, that is true. But, in the static partition case, although we only run 
it for a single static partition, we still need a GBY (see the explain insert 
overwrite table alter5 partition (dt='a') select key from src) in the new 
patch. In the dynamic partition case (or a mix of static and dynamic), we group 
by the partition key and run it for all the partitions because we do not know 
which partition the new data is going to go to. 

> Auto-gather column stats
> ------------------------
>
>                 Key: HIVE-11160
>                 URL: https://issues.apache.org/jira/browse/HIVE-11160
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Pengcheng Xiong
>            Assignee: Pengcheng Xiong
>         Attachments: HIVE-11160.01.patch, HIVE-11160.02.patch, 
> HIVE-11160.03.patch, HIVE-11160.04.patch, HIVE-11160.05.patch
>
>
> Hive will collect table stats when set hive.stats.autogather=true during the 
> INSERT OVERWRITE command. And then the users need to collect the column stats 
> themselves using "Analyze" command. In this patch, the column stats will also 
> be collected automatically. More specifically, INSERT OVERWRITE will 
> automatically create new column stats. INSERT INTO will automatically merge 
> new column stats with existing ones.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to