[jira] [Updated] (HIVE-23721) MetaStoreDirectSql.ensureDbInit() need to optimize QuerySQL

ASF GitHub Bot (Jira) Thu, 02 Jul 2020 00:48:17 -0700


     [ 
https://issues.apache.org/jira/browse/HIVE-23721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


ASF GitHub Bot updated HIVE-23721:
----------------------------------
    Labels: pull-request-available  (was: )

> MetaStoreDirectSql.ensureDbInit() need to optimize QuerySQL
> -----------------------------------------------------------
>
>                 Key: HIVE-23721
>                 URL: https://issues.apache.org/jira/browse/HIVE-23721
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 3.1.2
>         Environment: Hadoop 3.1（1700+ nodes）
> YARN 3.1 （with timelineserver enabled，https enabled)
> Hive 3.1 (15 HS2 instance)
> 60000+ YARN Applications every day
>            Reporter: YulongZ
>            Assignee: zhangbutao
>            Priority: Critical
>              Labels: pull-request-available
>             Fix For: 4.0.0
>
>         Attachments: HIVE-23721.01.patch
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> From Hive3.0，catalog added to hivemeta，many schema of metastore added column 
> “catName”，and index for table added column “catName”。
> In MetaStoreDirectSql.ensureDbInit() ，two queries below
> “
>       initQueries.add(pm.newQuery(MTableColumnStatistics.class, "dbName == 
> ''"));
>       initQueries.add(pm.newQuery(MPartitionColumnStatistics.class, "dbName 
> == ''"));
> ”
> should use "catName == ''" instead of "dbName == ''"，because “catName” is the 
> first index column。
> When  data of metastore become large，for example， table of 
> MPartitionColumnStatistics have millions of lines。The 
> “newQuery(MPartitionColumnStatistics.class, "dbName == ''")” for metastore 
> executed very slowly，and the query “show tables“ for hiveserver2 executed 
> very slowly too。



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Updated] (HIVE-23721) MetaStoreDirectSql.ensureDbInit() need to optimize QuerySQL

Reply via email to