László Pintér created HIVE-24928:
------------------------------------
Summary: In case of non-native tables use basic statistics from
HiveStorageHandler
Key: HIVE-24928
URL: https://issues.apache.org/jira/browse/HIVE-24928
Project: Hive
Issue Type: Bug
Components: Hive
Affects Versions: 4.0.0
Reporter: László Pintér
Assignee: László Pintér
Fix For: 4.0.0
When we are running `ANALYZE TABLE ... COMPUTE STATISTICS` or `ANALYZE TABLE
... COMPUTE STATISTICS FOR COLUMNS` all the basic statistics are collected by
the BasicStatsTask class. This class tries to estimate the statistics by
scanning the directory of the table.
In the case of non-native tables (iceberg, hbase), the table directory might
contain metadata files as well, which would be counted by the BasicStatsTask
when calculating basic stats.
Instead of having this logic, the HiveStorageHandler implementation should
provide basic statistics.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)