[
https://issues.apache.org/jira/browse/HIVE-23721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HIVE-23721:
----------------------------------
Labels: pull-request-available (was: )
> MetaStoreDirectSql.ensureDbInit() need to optimize QuerySQL
> -----------------------------------------------------------
>
> Key: HIVE-23721
> URL: https://issues.apache.org/jira/browse/HIVE-23721
> Project: Hive
> Issue Type: Bug
> Affects Versions: 3.1.2
> Environment: Hadoop 3.1(1700+ nodes)
> YARN 3.1 (with timelineserver enabled,https enabled)
> Hive 3.1 (15 HS2 instance)
> 60000+ YARN Applications every day
> Reporter: YulongZ
> Assignee: zhangbutao
> Priority: Critical
> Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-23721.01.patch
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> From Hive3.0,catalog added to hivemeta,many schema of metastore added column
> “catName”,and index for table added column “catName”。
> In MetaStoreDirectSql.ensureDbInit() ,two queries below
> “
> initQueries.add(pm.newQuery(MTableColumnStatistics.class, "dbName ==
> ''"));
> initQueries.add(pm.newQuery(MPartitionColumnStatistics.class, "dbName
> == ''"));
> ”
> should use "catName == ''" instead of "dbName == ''",because “catName” is the
> first index column。
> When data of metastore become large,for example, table of
> MPartitionColumnStatistics have millions of lines。The
> “newQuery(MPartitionColumnStatistics.class, "dbName == ''")” for metastore
> executed very slowly,and the query “show tables“ for hiveserver2 executed
> very slowly too。
--
This message was sent by Atlassian Jira
(v8.3.4#803005)