[
https://issues.apache.org/jira/browse/HIVE-21485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16798901#comment-16798901
]
Qingxin Wu commented on HIVE-21485:
-----------------------------------
Hi [~starphin], The reason why we keep so many partitions is due to important
historical data. And I think we should deal with it properly. The patch was
uploaded. Could you please take a look at it?
> Hive desc operation takes more than 100 seconds after upgrading from Hive
> 1.2.1 to 2.3.4
> ----------------------------------------------------------------------------------------
>
> Key: HIVE-21485
> URL: https://issues.apache.org/jira/browse/HIVE-21485
> Project: Hive
> Issue Type: Bug
> Components: CLI, Hive
> Affects Versions: 2.3.4
> Reporter: Qingxin Wu
> Assignee: Qingxin Wu
> Priority: Major
> Labels: pull-request-available
> Attachments: HIVE-21485.patch
>
> Time Spent: 40m
> Remaining Estimate: 0h
>
> Hive desc [formatted|extended] operation cost more than 100 seconds after
> upgrading from Hive 1.2.1 to 2.3.4. This is mainly caused by showing stats
> for partitioned tables which was introduced by HIVE-16098 when the
> partitioned tables have a large amount of partitions. In our case, the number
> of partition is 187221.
> {code:java}
> hive> desc bus.kafka_data;
> OK
> id string
> ...
> d map<string,string>
> stat_date string
> log_id string
> # Partition Information
> # col_name data_type comment
> stat_date string
> log_id string
> Time taken: 115.342 seconds, Fetched: 42 row(s)
> {code}
> same operation executed in hive-1.2.1 and only cost 2 seconds.
> {code:java}
> hive> desc bus.kafka_data;
> OK
> id string
> ...
> d map<string,string>
> stat_date string
> log_id string
> # Partition Information
> # col_name data_type comment
> stat_date string
> log_id string
> Time taken: 2.037 seconds, Fetched: 42 row(s)
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)