Shohei Okumiya created HIVE-28765:
-------------------------------------
Summary: Iceberg: Incorrect partition statistics on time travel +
partition evolution
Key: HIVE-28765
URL: https://issues.apache.org/jira/browse/HIVE-28765
Project: Hive
Issue Type: Bug
Components: Iceberg integration
Reporter: Shohei Okumiya
Assignee: Shohei Okumiya
HiveIcebergStorageHandler fails to fetch stats of evolved partitions. It
triggers schema and FS level stats estimation as a fallback and results in
unreasonable counts.
{code:java}
CREATE TABLE test (key INT, id INT) PARTITIONED BY SPEC (bucket(4, key)) STORED
BY ICEBERG;
INSERT INTO test VALUES (1, 1), (2, 2), (3, 3), (4, 4);
ALTER TABLE test CREATE TAG version1;ALTER TABLE test SET PARTITION SPEC
(bucket(256, key));
INSERT INTO test VALUES (1, 5), (2, 6), (3, 7), (4, 8);
ALTER TABLE test CREATE TAG version2;SET hive.fetch.task.conversion=none;
EXPLAIN SELECT * FROM default.test.tag_version1;
+----------------------------------------------------+
| Explain |
+----------------------------------------------------+
| Plan optimized by CBO. |
| |
| Stage-0 |
| Fetch Operator |
| limit:-1 |
| Stage-1 |
| Map 1 vectorized |
| File Output Operator [FS_4] |
| Select Operator [SEL_3] (rows=12940 width=8) |
| Output:["_col0","_col1"] |
| TableScan [TS_0] (rows=12940 width=8) |
| default@test,test,Tbl:PARTIAL,Col:COMPLETE,Output:["key","id"] |
| |
+----------------------------------------------------+{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)