zhangbutao commented on code in PR #4397:
URL: https://github.com/apache/hive/pull/4397#discussion_r1225289195


##########
ql/src/java/org/apache/hadoop/hive/ql/stats/ColStatsProcessor.java:
##########
@@ -220,8 +220,10 @@ public int persistColumnStats(Hive db, Table tbl) throws 
HiveException, MetaExce
       start = System. currentTimeMillis();
       if (tbl != null && tbl.isNonNative() && 
tbl.getStorageHandler().canSetColStatistics(tbl)) {
         tbl.getStorageHandler().setColStatistics(tbl, colStats);
+      } else {
+        // Set table or partition column statistics in metastore.
+        db.setPartitionColumnStatistics(request);
       }
-      db.setPartitionColumnStatistics(request);

Review Comment:
   
https://github.com/apache/hive/blob/10600392ec05bd50351b4668b38dd84502a7eb72/ql/src/java/org/apache/hadoop/hive/ql/optimizer/StatsOptimizer.java#L308-L311
  
   
   
https://github.com/apache/hive/blob/10600392ec05bd50351b4668b38dd84502a7eb72/ql/src/java/org/apache/hadoop/hive/ql/optimizer/StatsOptimizer.java#L932-L950
    Currently, Like this example HIVE-27347 always uses the iceberg basic stats 
from metatstore to optimize` count(*)` query. We should consider how to do this 
if only using puffin stats.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to