[
https://issues.apache.org/jira/browse/HIVE-29099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dmitriy Fingerman updated HIVE-29099:
-------------------------------------
Affects Version/s: 4.0.1
> Select query result mismatch in iceberg table vs hive table for the same
> inserted data
> --------------------------------------------------------------------------------------
>
> Key: HIVE-29099
> URL: https://issues.apache.org/jira/browse/HIVE-29099
> Project: Hive
> Issue Type: Bug
> Affects Versions: 4.1.0, 4.0.1
> Reporter: Vikram Ahuja
> Assignee: Vikram Ahuja
> Priority: Major
> Labels: correctness
>
> Hive Table:
>
> drop table tbl_parquet;
> create external table tbl_parquet(a int) partitioned by (b string) stored as
> parquet;
> insert into tbl_parquet values(1,'');
> insert into tbl_parquet values(2,null);
> insert into tbl_parquet values(3, NULL);
> select count(*) from tbl_parquet where b = ''; -> Returns 0
> select count(*) from tbl_parquet where b != ''; -> Returns 3
>
> Iceberg table
> drop table tbl_parquet_iceberg;
> create external table tbl_parquet_iceberg(a int) partitioned by (b string)
> stored by iceberg stored as parquet;
> insert into tbl_parquet_iceberg values(1,'');
> insert into tbl_parquet_iceberg values(2,null);
> insert into tbl_parquet_iceberg values(3, NULL);
> select count(*) from tbl_parquet_iceberg where b = ''; -> Returns 1
> select count(*) from tbl_parquet_iceberg where b != ''; -> Returns 2
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)