[
https://issues.apache.org/jira/browse/FLINK-33871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated FLINK-33871:
-----------------------------------
Labels: pull-request-available (was: )
> Reduce getTable call for hive client and optimize graph generation time
> -----------------------------------------------------------------------
>
> Key: FLINK-33871
> URL: https://issues.apache.org/jira/browse/FLINK-33871
> Project: Flink
> Issue Type: Improvement
> Reporter: hehuiyuan
> Priority: Major
> Labels: pull-request-available
>
> HiveCatalog.getHiveTable method wastes a lot of time when generate graph,
> because the number of calls is relatively high.
> I have an sql task with over 2000 rows, the HiveCatalog.getHiveTable method
> is called 4879 times , but only six hive tables were used.
> 
> The client.getTable method costs a lot of time.
> 
> There is a statistic that jobmanager interacts with hive when generate graph.
> If One call takes approximately 50 milliseconds ,
> How much time it spends : 4879 * 50 =243950ms = 243.95s = 4min
> We can cache and client.getTable method is only called six times.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)