[ 
https://issues.apache.org/jira/browse/HIVE-19095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16423837#comment-16423837
 ] 

Zoltan Haindrich commented on HIVE-19095:
-----------------------------------------

I think stat task execution is even worse than this 2 minutes...because 
metastore updates the stat data 1 by 1  - altought I'm not sure if that could 
be improved, but it definetly worth a look...

> Improve analyze statement execution time for partitioned tables
> ---------------------------------------------------------------
>
>                 Key: HIVE-19095
>                 URL: https://issues.apache.org/jira/browse/HIVE-19095
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Zoltan Haindrich
>            Priority: Major
>
> tpcds@1TB:
> {{analyze web_returns compile statistics for columns}} 
> both compile and stat task execution is slow; 
> there were ~2000 calls to get_partitions_ps_with_auth which took 2minutes
> stattask seems to be slow because it seems like the metastore updates the 
> stats 1 by 1 for each partition
> {flushCache=1, optimizer=565, open_txns=8, TezCompiler=5248, 
> get_table_req=69, get_partitions_ps_with_auth=130333}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to