[
https://issues.apache.org/jira/browse/HIVE-17051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16076398#comment-16076398
]
Remus Rusanu commented on HIVE-17051:
-------------------------------------
This is a simple query
{noformat}
SELECT DISTINCT * FROM src;
{noformat}
If multiple tables are present (eg. JOIN) each table metadata is requested
twice.
> Each table metadata is requested twice during query compile
> -----------------------------------------------------------
>
> Key: HIVE-17051
> URL: https://issues.apache.org/jira/browse/HIVE-17051
> Project: Hive
> Issue Type: Bug
> Components: Query Planning
> Reporter: Remus Rusanu
> Assignee: Remus Rusanu
> Labels: performance
>
> As far as I can tell, for each table referenced in a query the metadata is
> retrieved twice during compilation:
> first call:
> {noformat}
> at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1320)
> at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1275)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.getTableObjectByName(SemanticAnalyzer.java:10943)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.getMetaData(SemanticAnalyzer.java:1992)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.getMetaData(SemanticAnalyzer.java:1942)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genResolvedParseTree(SemanticAnalyzer.java:11178)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:11309)
> at
> org.apache.hadoop.hive.ql.parse.CalcitePlanner.analyzeInternal(CalcitePlanner.java:295)
> at
> org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:261)
> at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:566)
> {noformat}
> second call:
> {noformat}
> at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1320)
> at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1275)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.getTableObjectByName(SemanticAnalyzer.java:10943)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.getMetaData(SemanticAnalyzer.java:1992)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.getMetaData(SemanticAnalyzer.java:1942)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.getMetaData(SemanticAnalyzer.java:1934)
> at
> org.apache.hadoop.hive.ql.parse.CalcitePlanner.genOPTree(CalcitePlanner.java:431)
> at
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:11320)
> at
> org.apache.hadoop.hive.ql.parse.CalcitePlanner.analyzeInternal(CalcitePlanner.java:295)
> at
> org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:261)
> at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:566)
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)