[ 
https://issues.apache.org/jira/browse/HIVE-27185?focusedWorklogId=853891&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-853891
 ]

ASF GitHub Bot logged work on HIVE-27185:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 30/Mar/23 10:21
            Start Date: 30/Mar/23 10:21
    Worklog Time Spent: 10m 
      Work Description: ayushtkn commented on code in PR #4165:
URL: https://github.com/apache/hive/pull/4165#discussion_r1153050938


##########
iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/IcebergTableUtil.java:
##########
@@ -52,12 +52,13 @@ private IcebergTableUtil() {
    * Constructs the table properties needed for the Iceberg table loading by 
retrieving the information from the
    * hmsTable. It then calls {@link IcebergTableUtil#getTable(Configuration, 
Properties)} with these properties.
    * @param configuration a Hadoop configuration
-   * @param hmsTable the HMS table
-   * @param skipCache if set to true there won't be an attempt to retrieve the 
table from SessionState
+   * @param hmsTable      the HMS table
+   * @param skipCache     if set to true there won't be an attempt to retrieve 
the table from SessionState
+   * @param suffix        the suffix to use for cache.
    * @return the Iceberg table
    */
   static Table getTable(Configuration configuration, 
org.apache.hadoop.hive.metastore.api.Table hmsTable,
-      boolean skipCache) {
+      boolean skipCache, String suffix) {

Review Comment:
   We want to cache the table for Stats only, if you fetch once for Stats, then 
use that till end, if you fetch for something else, we don't use that.
   
   In most cases that won't be a problem, but the way we do CTAS and MV's it 
did creates problem. It first creates an iceberg table and then does the load 
into that, so in that cases caching would have stale table. Got a test failure 
as well for those MV cases in the last run





Issue Time Tracking
-------------------

    Worklog Id:     (was: 853891)
    Time Spent: 50m  (was: 40m)

> Iceberg: Cache iceberg table while loading for stats
> ----------------------------------------------------
>
>                 Key: HIVE-27185
>                 URL: https://issues.apache.org/jira/browse/HIVE-27185
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Ayush Saxena
>            Assignee: Ayush Saxena
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> Presently iceberg for stats loads the iceberg table multiple times for stats 
> via different routes.
> Cache it to avoid reading/loading the iceberg table multiple times.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to