Shohei Okumiya created HIVE-28765:
-------------------------------------

             Summary: Iceberg: Incorrect partition statistics on time travel + 
partition evolution
                 Key: HIVE-28765
                 URL: https://issues.apache.org/jira/browse/HIVE-28765
             Project: Hive
          Issue Type: Bug
          Components: Iceberg integration
            Reporter: Shohei Okumiya
            Assignee: Shohei Okumiya


HiveIcebergStorageHandler fails to fetch stats of evolved partitions. It 
triggers schema and FS level stats estimation as a fallback and results in 
unreasonable counts.
{code:java}
CREATE TABLE test (key INT, id INT) PARTITIONED BY SPEC (bucket(4, key)) STORED 
BY ICEBERG;
INSERT INTO test VALUES (1, 1), (2, 2), (3, 3), (4, 4);
ALTER TABLE test CREATE TAG version1;ALTER TABLE test SET PARTITION SPEC 
(bucket(256, key));
INSERT INTO test VALUES (1, 5), (2, 6), (3, 7), (4, 8);

ALTER TABLE test CREATE TAG version2;SET hive.fetch.task.conversion=none;
EXPLAIN SELECT * FROM default.test.tag_version1;

+----------------------------------------------------+
|                      Explain                       |
+----------------------------------------------------+
| Plan optimized by CBO.                             |
|                                                    |
| Stage-0                                            |
|   Fetch Operator                                   |
|     limit:-1                                       |
|     Stage-1                                        |
|       Map 1 vectorized                             |
|       File Output Operator [FS_4]                  |
|         Select Operator [SEL_3] (rows=12940 width=8) |
|           Output:["_col0","_col1"]                 |
|           TableScan [TS_0] (rows=12940 width=8)    |
|             default@test,test,Tbl:PARTIAL,Col:COMPLETE,Output:["key","id"] |
|                                                    |
+----------------------------------------------------+{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to