[
https://issues.apache.org/jira/browse/HIVE-27409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhangbutao updated HIVE-27409:
------------------------------
Description:
We have supported iceberg statistics recently. e.g. _HIVE-24928_ and
{_}HIVE-27158{_}. And we can use iceberg stats to optimize some queries like
{_}HIVE-27347{_}.
However, in current hive codebase, we prohibit using EXTERNAL table stats and
this change was introduced by HIVE-11266. And HIVE-19329 also disabled some
optimizations for EXTERNAL table whether it is iceberg or not. Therefore, The
EXTERNAL type iceberg table can not use stats to optimize query.
In {_}HIVE-24928{_}, we have added method
*_HiveStorageHandler::canProvideBasicStatistics()_* to indicate iceberg can
have the ability to provide stats. That is to say, Although Iceberg table is
regard as EXTERNAL table in Hive, it can provide details statistics.
Therefore, here i suggest we should check both table type and boolean result of
*_HiveStorageHandler::canProvideBasicStatistics()_* to determine if the table
can use stats.
was:
We have supported iceberg statistics recently. e.g. _HIVE-24928_ and
{_}HIVE-27158{_}. And we can use iceberg stats to optimize some queries like
{_}HIVE-27347{_}.
However, in current hive codebase, we prohibit using EXTERNAL table stats and
this change was introduced by HIVE-11266. Therefore, The EXTERNAL type iceberg
table can not use stats to optimize query.
In {_}HIVE-24928{_}, we have added method
*_HiveStorageHandler::canProvideBasicStatistics()_* to indicate iceberg can
have the ability to provide stats. That is to say, Although Iceberg table is
regard as EXTERNAL table in Hive, it can provide details statistics.
Therefore, here i suggest we should check both table type and boolean result of
*_HiveStorageHandler::canProvideBasicStatistics()_* to determine if the table
can use stats.
> Iceberg: table with EXTERNAL type can not use statistics to optimize the query
> ------------------------------------------------------------------------------
>
> Key: HIVE-27409
> URL: https://issues.apache.org/jira/browse/HIVE-27409
> Project: Hive
> Issue Type: Improvement
> Components: Iceberg integration
> Reporter: zhangbutao
> Assignee: zhangbutao
> Priority: Minor
> Labels: pull-request-available
>
> We have supported iceberg statistics recently. e.g. _HIVE-24928_ and
> {_}HIVE-27158{_}. And we can use iceberg stats to optimize some queries like
> {_}HIVE-27347{_}.
> However, in current hive codebase, we prohibit using EXTERNAL table stats and
> this change was introduced by HIVE-11266. And HIVE-19329 also disabled some
> optimizations for EXTERNAL table whether it is iceberg or not. Therefore,
> The EXTERNAL type iceberg table can not use stats to optimize query.
>
> In {_}HIVE-24928{_}, we have added method
> *_HiveStorageHandler::canProvideBasicStatistics()_* to indicate iceberg can
> have the ability to provide stats. That is to say, Although Iceberg table is
> regard as EXTERNAL table in Hive, it can provide details statistics.
>
> Therefore, here i suggest we should check both table type and boolean result
> of *_HiveStorageHandler::canProvideBasicStatistics()_* to determine if the
> table can use stats.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)