[
https://issues.apache.org/jira/browse/HIVE-29099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18034235#comment-18034235
]
Dmitriy Fingerman commented on HIVE-29099:
------------------------------------------
I checked on version 4.0.1, the issue exists there, but returns different
results:
for non-iceberg - same,
for iceberg: return 1 and 0 instead of 1 and 2.
> Select query result mismatch in iceberg table vs hive table for the same
> inserted data
> --------------------------------------------------------------------------------------
>
> Key: HIVE-29099
> URL: https://issues.apache.org/jira/browse/HIVE-29099
> Project: Hive
> Issue Type: Bug
> Affects Versions: 4.1.0, 4.0.1
> Reporter: Vikram Ahuja
> Assignee: Vikram Ahuja
> Priority: Major
> Labels: correctness
>
> Hive Table:
>
> drop table tbl_parquet;
> create external table tbl_parquet(a int) partitioned by (b string) stored as
> parquet;
> insert into tbl_parquet values(1,'');
> insert into tbl_parquet values(2,null);
> insert into tbl_parquet values(3, NULL);
> select count(*) from tbl_parquet where b = ''; -> Returns 0
> select count(*) from tbl_parquet where b != ''; -> Returns 3
>
> Iceberg table
> drop table tbl_parquet_iceberg;
> create external table tbl_parquet_iceberg(a int) partitioned by (b string)
> stored by iceberg stored as parquet;
> insert into tbl_parquet_iceberg values(1,'');
> insert into tbl_parquet_iceberg values(2,null);
> insert into tbl_parquet_iceberg values(3, NULL);
> select count(*) from tbl_parquet_iceberg where b = ''; -> Returns 1
> select count(*) from tbl_parquet_iceberg where b != ''; -> Returns 2
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)